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JVIOLECULES FOR DIAGNOSTICS AN DJTHER APEUT ICS 

TECHNICAL FIELD 

The present invention relates to human molecules and to the use of these sequences in the 
5 diagnosis, study, prevention, and treatment of diseases associated with, as well as effects of exogenous 
compounds on, the expression of human molecules. 

BACKGROUND OF THE INVENTION 

The human genome is comprised of thousands of genes, many encoding gene products that 

1 o function in the maintenance and growth of the various cells and tissues in the body. Aberrant 

expression or mutations in these genes and their products is the cause of, or is associated with, a variety 
of human diseases such as cancer and other cell proliferative disorders, autoimmune/inflammatory 
disorders, infections, developmental disorders, endocrine disorders, metabolic disorders, neurological 
disorders, gastrointestinal disorders, transport disorders, and connective tissue disorders. The 
1 5 identification of these genes and their products is the basis of an ever-expanding effort to find markers 
for early detection of diseases, and targets for their prevention and treatment. Therefore, these genes 
and their products are useful as diagnostics and therapeutics. These genes may encode, for example, 
enzyme molecules, molecules associated with growth and development, biochemical pathway molecules, 
extracellular information transmission molecules, receptor molecules, intracellular signaling molecules, 

2 o membrane transport molecules, protein modification and maintenance molecules, nucleic acid synthesis 

and modification molecules, adhesion molecules, antigen recognition molecules, secreted and 
extracellular matrix molecules, cytoskeletal molecules, ribosomal molecules, electron transfer 
associated molecules, transcription factor molecules, chromatin molecules, cell membrane molecules, 
and organelle associated molecules. 

2 5 For example, cancer represents a type of cell proliferative disorder that affects nearly every 

tissue in the body. A wide variety of molecules, either aberrantly expressed or mutated, can be the 
cause of, or involved with, various cancers because tissue growth involves complex and ordered 
patterns of cell proliferation, cell differentiation, and apoptosis. Cell proliferation must be regulated to 
maintain both the number of cells and their spatial organization. This regulation depends upon the 

3 o appropriate expression of proteins which control cell cycle progression in response to extracellular 

signals such as growth factors and other mitogens, and intracellular cues such as DNA damage or 
nutrient starvation. Molecules which directly or indirectly modulate cell cycle progression fall into 
several categories, including growth factors and their receptors, second messenger and signal 
transduction proteins, oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. 
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Aberrant expression or mutations in any of these gene products can result in cell proliferative disorders 
such as cancer. Oncogenes are genes generally derived from normal genes that, through abnormal 
expression or mutation, can effect the transformation of a normal cell to a malignant one (oncogenesis). 
Oncoproteins, encoded by oncogenes, can affect cell proliferation in a variety of ways and include 
5 growth factors, growth factor receptors, intracellular signal transducers, nuclear transcription factors, 
and cell-cycle control proteins. In contrast, tumor-suppressor genes are involved in inhibiting cell 
proliferation. Mutations which cause reduced function or loss of function in tumorrsuppressor genes 
result in aberrant cell proliferation and cancer. Although many different genes and their products have 
been found to be associated with cell proliferative disorders such as cancer, many more may exist that 

1 o are yet to be discovered. 

DNA-based arrays can provide a simple way to explore the expression of a single polymorphic 
gene or a large number of genes. When the expression of a single gene is explored, DNA-based arrays 
are employed to detect the expression of specific gene variants. For example, a p53 tumor suppressor 
gene array is used to determine whether individuals are carrying mutations that predispose them to 
15 cancer. A cytochrome p450 gene array is usefiil to determine whether individuals have one of a number 
of specific mutations that could result in increased drug metabolism, drug resistance or drug toxicity. 

DNA-based array technology is especially relevant for the rapid screening of expression of a 
large number of genes. There is a growing awareness that gene expression is affected in a global 
fashion. A genetic predisposition, disease or therapeutic treatment may affect, directly or indirectly, the 

2 o expression of a large number of genes. In some cases the interactions may be expected, such as when 

the genes are part of the same signaling pathway. In other cases, such as when the genes participate in 
separate signaling pathways, the interactions may be totally unexpected. Therefore, DNA-based arrays 
can be used to investigate how genetic predisposition, disease, or therapeutic treatment affects the 
expression of a large number of genes. 

25 

Enzyme Molecules 

SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, and SEQ ID NO:8 encode, for example, human enzyme molecules. 

The cellular processes of biogenesis and biodegradation involve a number of key enzyme 
30 classes including oxidoreductases, transferases, hydrolases, lyases, isomerases, and ligases. These 
enzyme classes are each comprised of numerous substrate-specific enzymes having precise and well 
regulated functions. These enzymes function by facilitating metabolic processes such as glycolysis, 
the tricarboxylic cycle, and fatty acid metabolism; synthesis or degradation of amino acids, steroids, 
phospholipids, alcohols, etc.; regulation of cell signalling, proliferation, inflamation, apoptosis, etc., 
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and through catalyzing critical steps in DNA replication and repair, and the process of translation. 
Oxidoreductases 

Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) activity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
5 cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleotide 
(FAD), and the nicotinamide adenine dinucleotides NAD and NADP (Newsholme, E. A. and A.R. 
Leech (1983) Biochemistry for the Medical Sciences . John Wiley and Sons, Chichester, U.K., pp. 
779-793). Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) 
with concurrent oxidation of the cofactor. The reverse dehydrogenase reaction catalyzes the reduction 

10 of a cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad 
superfamily of proteins that catalyze numerous reactions in all cells of organisms ranging from 
bacteria to plants to humans. These reactions include metabolism of sugar, certain detoxification 
reactions in the liver, and the synthesis or degradation of fatty acids, amino acids, glucocorticoids, 
estrogens, androgens, and prostaglandins. Different family members are named according to the 

15 direction in which their reactions are typically catalyzed; thus they may be referred to as 

oxidoreductases, oxidases, reductases, or dehydrogenases. In addition, family members often have 
distinct cellular localizations, including the cytosol, the plasma membrane, mitochondrial inner or 
outer membrane, and peroxisomes. 

Short-chain alcohol dehydrogenases (SC ADs) are a family of dehydrogenases that only share 

20 15% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and 
the substrate binding domain. In addition to the well-known role in detoxification of ethanol, SCADs 
are also involved in synthesis and degradation of fatty acids, steroids, and some prostaglandins, and 
are therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD 
deficiency, and certain genetic disorders. For example, retinol dehydrogenase is a SCAD-family 

25 member (Simon, A. et al. (1995) J. Biol. Chem. 270:1 107-1 112) that converts retinol to retinal, the 
precursor of retinoic acid. Retinoic acid, a regulator of differentiation and apoptosis, has been shown 
to down-regulate genes involved in cell proliferation and inflammation (Chai, X. et al. (1995) J. Biol. 
Chem. 270:3900-3904). In addition, retinol dehydrogenase has been linked to hereditary eye diseases 
such as autosomal recessive childhood-onset severe retinal dystrophy (Simon, A. et al. (1996) 

30 Genomics 36:424-430). 

Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction 
of the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. 
(1991) Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ann. N.Y. Acad. Sci. 664:415-424; 
Brown, J.K. and H. Imam (1991) J. Inherit Metab. Dis. 14:436-458). Many pathways of 

35 neurotransmitter metabolism require oxidoreductase activity, coupled to reduction or oxidation of a 
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cofactor, such as NAD7NADH (Newsholme, E.A. and A.R. Leech (1983) Biochemistry for the 
Medical Sciences . John Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of 
catecholamines (epinephrine or norepinephrine) requires alcohol dehydrogenase (in the brain) or 
aldehyde dehydrogenase (in peripheral tissue). NAD + -dependent aldehyde dehydrogenase oxidizes 5- 
5 hydroxyindole-3-acetate (the product of 5-hydroxytryptamine (serotonin) metabolism) in the brain, 
blood platelets, liver and pulmonary endothelium (Newsholme, supra , p. 786). Other 
neurotransmitter degradation pathways that utilize N AD + /NADH-dependent oxidoreductase activity 
include those of L-DOPA (precursor of dopamine, a neuronal excitatory compound), glycine (an 
inhibitory neurotransmitter in the brain and spinal cord), histamine (liberated from mast cells during 

10 the inflammatory response), and taurine (an inhibitory neurotransmitter of the brain stem, spinal cord 
and retina) (Newsholme, supra , pp. 790, 792). Epigenetic or genetic defects in neurotransmitter 
metabolic pathways can result in a spectrum of disease states in different tissues including Parkinson 
disease and inherited myoclonus (McCance, K.L. and S.E. Huether (1994) Pathophysiology . Mosby- 
Year Book, Inc., St. Louis MO, pp. 402-404; Gundlach, A.L. (1990) FASEB J. 4:2761-2766). 

15 Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 

one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, 
pyrimidines, and the amino acid methionine. Tetrahydrofolate is generated by the activity of a 
holoenzyme complex called tetrahydrofolate synthase, which includes three enzyme activities: 
tetrahydrofolate dehydrogenase, tetrahydrofolate cyclohydrolase, and tetrahydrofolate synthetase. 

2 o Thus, tetrahydrofolate dehydrogenase plays an important role in generating building blocks for 
nucleic and amino acids, crucial to proliferating cells. 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 
catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of 
NAD to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3HACD 

2 5 and enoyl-CoA hydratase form an enzyme complex called bifunctional enzyme, defects in which are 

associated with peroxisomal bifunctional enzyme deficiency. This interruption in fatty acid 
metabolism produces accumulation of very-long chain fatty acids, disrupting development of the 
brain, bone, and adrenal glands. Infants born with this deficiency typically die within 6 months 
(Watkins, P. et al. (1989) J. Clin. Invest. 83:771-777; Online Mendelian Inheritance in Man (OMIM), 
30 #261515). The neurodegeneration that is characteristic of Alzheimer's disease involves development 
of extracellular plaques in certain brain regions. A major protein component of these plaques is the 
peptide amyloid-P (AP), which is one of several cleavage products of amyloid precursor protein 
(APP). 3HACD has been shown to bind the AP peptide, and is overexpressed in neurons affected in 
Alzheimer* s disease. In addition, an antibody against 3HACD can block the toxic effects of Ap in a 

3 5 cell culture model of Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, 
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#602057). 

Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 
common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes 
act upon cholesterol, including a number of dehydrogenases. Steroid dehydrogenases, such as the 
5 hydroxysteroid dehydrogenases, are involved in hypertension, fertility, and cancer (Duax, W.L. and 
D. Ghosh (1997) Steroids 62:95-100). One such dehydrogenase is 3-oxo-5-oc-steroid dehydrogenase 
(OASD), a microsomal membrane protein highly expressed in prostate and other androgen-responsive 
tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, which is the most 
potent androgen. Dihydrotestosterone is essential for the formation of the male phenotype during 

10 embryogenesis, as well as for proper androgen-mediated growth of tissues such as the prostate and 
male genitalia. A defect in OASD that prevents the conversion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudoheimaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S. et al. (1991) Nature 354:159-161; Labrie, F. et al. 
(1992) Endocrinology 131:1571-1573; OMIM #264600). Thus, OASD plays a central role in sexual 

1 5 differentiation and androgen physiology. 

17P-hydroxy steroid dehydrogenase (17PHSD6) plays an important role in the regulation of 
the male reproductive hormone, dihydrotestosterone (DHTT). 17pHSD6 acts to reduce levels of 
DHTT by oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuronidated 
and removed from tissues. 17PHSD6 is active with both androgen and estrogen substrates when 

20 expressed in embryonic kidney 293 cells. At least five other isozymes of 17pHSD have been 

identified that catalyze oxidation and/or reduction reactions in various tissues with preferences for 
different steroid substrates (Biswas, M.G. and D.W. Russell (1997) J. Biol. Chem. 272:15959- 
15966). For example, 17PHSD1 preferentially reduces estradiol and is abundant in the ovary and 
placenta. 17PHSD2 catalyzes oxidation of androgens and is present in the endometrium and placenta. 

25 17pHSD3 is exclusively a reductive enzyme in the testis (Geissler, W.M. et al. (1994) Nat. Genet. 
7:34-39). An excess of androgens such as DHTT can contribute to certain disease states such as 
benign prostatic hyperplasia and prostate cancer. 

Oxidoreductases are components of the fatty acid metabolism pathways in mitochondria and 
peroxisomes. The main beta-oxidation pathway degrades both saturated and unsaturated fatty acids, 

3 o while the auxiliary pathway perf orms additional steps required for the degradation of unsaturated fatty 
acids. The auxiliary beta-oxidation enzyme 2,4-dienoyl-CoA reductase catalyzes the removal of 
even-numbered double bonds from unsaturated fatty acids prior to their entry into the main beta- 
oxidation pathway. The enzyme may also remove odd-numbered double bonds from unsaturated 
fatty acids (Koivuranta, K.T. et al. (1994) Biochem. J. 304:787-792; Smeland, T.E. et al. (1992) Proc. 

3 5 Natl. Acad. Sci. USA 89:6673-6677). 2,4-dienoyl-CoA reductase is located in both mitochondria and 
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peroxisomes. Inherited deficiencies in mitochondrial and peroxisomal beta-oxidation enzymes are 
associated with severe diseases, some of which manifest themselves soon after birth and lead to death 
within a few years. Defects in beta-oxidation are associated with Reye's syndrome, Zellweger 
syndrome, neonatal adrenoleukodystrophy, infantile Refsum's disease, acyl-CoA oxidase deficiency, 
5 and bifunctional protein deficiency (Suzuki, Y. et al. (1994) Am. J. Hum. Genet. 54:36-43; Hoefler, 
supra; Cotran, R.S. et al. (1994) Robbins Pathologic Basis of Disease , W.B. Saunders Co., 
Philadelphia PA, p.866). Peroxisomal beta-oxidation is impaired in cancerous tissue. Although 
neoplastic human breast epithelial cells have the same number of peroxisomes as do normal cells, 
fatty acyl-CoA oxidase activity is lower than in control tissue (el Bouhtoury, F. et al. (1992) J. Pathol. 

10 1 66:27-35). Human colon carcinomas have fewer peroxisomes than normal colon tissue and have 
lower fatty-acyl-CoA oxidase and bifunctional enzyme (including enoyl-CoA hydratase) activities 
than normal tissue (Cable, S. et al. (1992) Virchows Arch. B Cell Pathol. Incl. Mol. Pathol. 62:221- 
226). Another important oxidoreductase is isocitrate dehydrogenase, which catalyzes the conversion 
of isocitrate to a-ketoglutarate, a substrate of the citric acid cycle. Isocitrate dehydrogenase can be 

15 either NAD or NADP dependent, and is found in the cytosol, mitochondria, and peroxisomes. 
Activity of isocitrate dehydrogenase is regulated developmentally, and by hormones, 
neurotransmitters, and growth factors. 

Hydroxypyruvate reductase (HPR), a peroxisomal 2-hydroxyacid dehydrogenase in the 
glycolate pathway, catalyzes the conversion of hydroxypyruvate to glycerate with the oxidation of 

20 both NADH and NADPH. The reverse dehydrogenase reaction reduces NAD* and NADP+. HPR 
recycles nucleotides and bases back into pathways leading to the synthesis of ATP and GTP. ATP 
and GTP are used to produce DNA and RN A and to control various aspects of signal transduction and 
energy metabolism. Inhibitors of purine nucleotide biosynthesis have long been employed as 
antiproliferative agents to treat cancer and viral diseases. HPR also regulates biochemical synthesis 

25 of serine and cellular serine levels available for protein synthesis. 

The mitochondrial electron transport (or respiratory) chain is a series of oxidoreductase-type 
enzyme complexes in the mitochondrial membrane that is responsible for the transport of electrons 
from NADH through a series of redox centers within these complexes to oxygen, and the coupling of 
this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP then provides the primary 

30 source of energy for driving a cell's many energy-requiring reactions. The key complexes in the 
respiratory chain are NADH ubiquinone oxidoreductase (complex I), succinate:ubiquinone 
oxidoreductase (complex II), cytochrome c r b oxidoreductase (complex III), cytochrome c oxidase 
(complex IV), and ATP synthase (complex V) (Alberts, B. et al. (1994) Molecular Biology of the 
Cell , Garland Publishing, Inc., New York NY, pp. 677-678). All of these complexes are located on 

3 5 the inner matrix side of the mitochondrial membrane except complex II, which is on the cytosolic 
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side. Complex II transports electrons generated in the citric acid cycle to the respiratory chain. The 
electrons generated by oxidation of succinate to fumarate in the citric acid cycle are transferred 
through electron carriers in complex II to membrane bound ubiquinone (Q). Transcriptional 
regulation of these nuclear-encoded genes appears to be the predominant means for controlling the 
5 biogenesis of respiratory enzymes. Defects and altered expression of enzymes in the respiratory chain 
are associated with a variety of disease conditions. 

Other dehydrogenase activities using NAD as a cofactor are also important in mitochondrial 
function. 3-hydroxyisobutyrate dehydrogenase (3HBD), important in valine catabolism, catalyzes the 
NAD-dependent oxidation of 3-hydroxyisobutyrate to methylmalonate semialdehyde within 

10 mitochondria. Elevated levels of 3-hydroxyisobutyrate have been reported in a number of disease 
states, including ketoacidosis, methylmalonic acidemia, and other disorders associated with 
deficiencies in methylmalonate semialdehyde dehydrogenase (Rougraff, P.M. et al. (1989) J. Biol. 
Chem. 264:5899-5903). 

Another mitochondrial dehydrogenase important in amino acid metabolism is the enzyme 

is isovaleryl-CoA-dehydrogenase (IVD). IVD is involved in leucine metabolism and catalyzes the 

oxidation of isovaleryl-CoA to 3-methylcrotonyl-CoA Human IVD is a tetrameric flavoprotein that 
is encoded in the nucleus and synthesized in the cytosol as a 45 kDa precursor with a mitochondrial 
import signal sequence. A genetic deficiency, caused by a mutation in the gene encoding IVD, results 
in the condition known as isovaleric acidemia. This mutation results in inefficient mitochondrial 

20 import and processing of the IVD precursor (Vockley, J. et al. (1992) J. Biol. Chem. 267:2494-2501). 
Transferases 

Transferases are enzymes that catalyze the transfer of molecular groups. The reaction may 
involve an oxidation, reduction, or cleavage of covalent bonds, and is often specific to a substrate or 
to particular sites on a type of substrate. Transferases participate in reactions essential to such 

25 functions as synthesis and degradation of cell components, regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretion. Transferases are 
involved in key steps in disease processes involving these functions. Transferases are frequently 
classified according to the type of group transferred. For example, methyl transferases transfer one- 
carbon methyl groups, amino transferases transfer nitrogenous amino groups, and similarly 

3 o denominated enzymes transfer aldehyde or ketone, acyl, glycosyl, alkyl or aryl, isoprenyl, saccharyl, 
phosphorous-containing, sulfur-containing, or selenium-containing groups, as well as small 
enzymatic groups such as Coenzyme A. 

Acyl transferases include peroxisomal carnitine octanoyl transferase, which is involved in the 
fatty acid beta-oxidation pathway, and mitochondrial carnitine palmitoyl transferases, involved in 

3 5 fatty acid metabolism and transport. Choline O-acetyl transferase catalyzes the biosynthesis of the 

7 
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neurotransmitter acetylcholine. 

Amino transferases play key roles in protein synthesis and degradation, and they contribute to 
other processes as well. For example, the amino transferase 5- aminolevulinic acid synthase catalyzes 
the addition of succinyl-CoA to glycine, the first step in heme biosynthesis. Other amino transferases 
5 participate in pathways important for neurological function and metabolism. For example, glutamine- 
phenylpyruvate amino transferase, also known as glutamine transaminase K (GTK), catalyzes several 
reactions with a pyridoxal phosphate cofactor. GTK catalyzes the reversible conversion of L- 
glutamine and phenylpyruvate to 2-oxoglutaramate and L-phenylalanine. Other amino acid substrates 
for GTK include L-methionine, L-histidine, and L-tyrosine. GTK also catalyzes the conversion of 

10 kynurenine to kynurenic acid, a tryptophan metabolite that is an antagonist of the N-methyl-D- 

aspartate (NMDA) receptor in the brain and may exert a neuromodulatory function. Alteration of the 
kynurenine metabolic pathway may be associated with several neurological disorders. GTK also 
plays a role in the metabolism of halogenated xenobiotics conjugated to glutathione, leading to 
nephrotoxicity in rats and neurotoxicity in humans. GTK is expressed in kidney, liver, and brain. 

1 5 Both human and rat GTKs contain a putative pyridoxal phosphate binding site (ExPASy ENZYME: 
EC 2.6.1.64; Perry, S.J. et al. (1993) Mol. Pharmacol. 43:660-665; Perry, S. et al. (1995) FEBS Lett 
360:277-280; and Alberati-Giani, D. et al. (1995) J. Neurochem. 64:1448-1455). A second amino 
transferase associated with this pathway is kynurenine/cc-aminoadipate amino transferase (AadAT). 
AadAT catalyzes the reversible conversion of a-aminoadipate and a-ketoglutarate to a-ketoadipate 

2 o and L-glutamate during lysine metabolism. AadAT also catalyzes the transamination of kynurenine 

to kynurenic acid. A cytosolic AadAT is expressed in rat kidney, liver, and brain (Nakatani, Y. et al. 
(1970) Biochim. Biophys. Acta 198:219-228; Buchli, R. et al. (1995) J. Biol. Chem. 270:29330- 
29335). 

Glycosyl transferases include the mammalian UDP-glucouronosyl transferases, a family of 
25 membrane-bound microsomal enzymes catalyzing the transfer of glucouronic acid to lipophilic 

substrates in reactions that play important roles in detoxification and excretion of drugs, carcinogens, 
and other foreign substances. Another mammalian glycosyl transferase, mammalian UDP-galactose- 
ceramide galactosyl transferase, catalyzes the transfer of galactose to ceramide in the synthesis of 
galactocerebrosides in myelin membranes of the nervous system. The UDP-glycosyl transferases 
30 share a conserved signature domain of about 50 amino acid residues (PROSITE: PDOC00359, 
http://expasy .hcuge.ch/sprot4>rosite.html) . 

Methyl transferases are involved in a variety of pharmacologically important processes. 
Nicotinamide N-methyl transferase catalyzes the N-methylation of nicotinamides and other pyridines, 
an important step in the cellular handling of drugs and other foreign compounds. 

3 5 Phenylethanolamine N-methyl transferase catalyzes the conversion of noradrenalin to adrenalin. 6-0- 
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methylguanine-DNA methyl transferase reverses DNA methylation, an important step in 
carcinogenesis. Uroporphyrin-M C-methyl transferase, which catalyzes the transfer of two methyl 
groups from S-adenosyl-L-methionine to uroporphyrinogen III, is the first specific enzyme in the 
biosynthesis of cobalamin, a dietary enzyme whose uptake is deficient in pernicious anemia. Protein- 
5 arginine methyl transferases catalyze the posttranslational methylation of arginine residues in 
proteins, resulting in the mono- and dimethylation of arginine on the guanidino group. Substrates 
include histones, myelin basic protein, and heterogeneous nuclear ribonucleoproteins involved in 
mRNA processing, splicing, and transport. Protein-arginine methyl transferase interacts with proteins 
upregulated by mitogens, with proteins involved in chronic lymphocytic leukemia, and with 

10 interferon, suggesting an important role for methylation in cytokine receptor signaling (Lin, W.-J. et 
aL (1996) J. Biol. Chem. 271:15034-15044; Abramovich, C. et al. (1997) EMBO J. 16:260-266; and 
Scott, H.S. et al. (1998) Genomics 48:330-340). 

Phosphotransferases catalyze the transfer of high-energy phosphate groups and are important 
in energy-requiring and -releasing reactions. The metabolic enzyme creatine kinase catalyzes the 

15 reversible phosphate transfer between creatine/creatine phosphate and ATP/ADP. Glycocy amine 
kinase catalyzes phosphate transfer from ATP to guanidoacetate, and arginine kinase catalyzes 
phosphate transfer from ATP to arginine. A cysteine-containing active site is conserved in this family 
(PROS1TE: PDOC00103). 

Prenyl transferases are heterodimers, consisting of an alpha and a beta subunit, that catalyze 

2 o the transfer of an isoprenyl group. An example of a prenyl transferase is the mammalian protein 

farnesyl transferase. The alpha subunit of farnesyl transferase consists of 5 repeats of 34 amino acids 
each, with each repeat containing an invariant tryptophan (PROSITE: PDOC00703). 

Saccharyl transferases are glycating enzymes involved in a variety of metabolic processes. 
Oligosacchryl transferase-48, for example, is a receptor for advanced glycation endproducts. 
25 Accumulation of these endproducts is observed in vascular complications of diabetes, macrovascular 
disease, renal insufficiency, and Alzheimer's disease (Thornalley, P.J. (1998) Cell Mol. Biol. (Noisy- 
Le-Grand) 44:1013-1023). 

Coenzyme A (CoA) transferase catalyzes the transfer of CoA between two carboxylic acids. 
Succinyl CoA:3-oxoacid CoA transferase, for example, transfers CoA from succinyl-CoA to a 

3 o recipient such as acetoacetate. Acetoacetate is essential to the metabolism of ketone bodies, which 

accumulate in tissues affected by metabolic disorders such as diabetes (PROSITE: PDOC00980). 
Hydrolases 

Hydrolysis is the breaking of a covalent bond in a substrate by introduction of a molecule of 
water. The reaction involves a nucleophilic attack by the water molecule's oxygen atom on a target 
3 5 bond in the substrate. The water molecule is split across the target bond, breaking the bond and 
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generating two product molecules. Hydrolases participate in reactions essential to such functions as 
synthesis and degradation of cell components, and for regulation of cell functions including cell 
signaling, cell proliferation, inflamation, apoptosis, secretion and excretion. Hydrolases are involved 
in key steps in disease processes involving these functions. Hydrolytic enzymes, or hydrolases, may 
5 be grouped by substrate specificity into classes including phosphatases, peptidases, 
lysophospholipases, phosphodiesterases, glycosidases, and glyoxalases. 

Phosphatases hydrolytically remove phosphate groups from proteins, an energy-providing 
step that regulates many cellular processes, including intracellular signaling pathways that in turn 
control cell growth and differentiation, cell-cell contact, the cell cycle, and oncogenesis. 

10 Lysophospholipases (LPLs) regulate intracellular lipids by catalyzing the hydrolysis of ester 

bonds to remove an acyl group, a key step in lipid degradation. Small LPL isoforms, approximately 
15-30 kD, function as hydrolases; larger isoforms function both as hydrolases and transacylases. A 
particular substrate for LPLs, lysophosphatidylcholine, causes lysis of cell membranes. LPL activity 
is regulated by signaling molecules important in numerous pathways, including the inflammatory 

15 response. 

Peptidases, also called proteases, cleave peptide bonds that form the backbone of peptide or 
protein chains. Proteolytic processing is essential to cell growth, differentiation, remodeling, and 
homeostasis as well as inflammation and immune response. Since typical protein half-lives range 
from hours to a few days, peptidases are continually cleaving precursor proteins to their active form, 

20 removing signal sequences from targeted proteins, and degrading aged or defective proteins. 

Peptidases function in bacterial, parasitic, and viral invasion and replication within a host. Examples 
of peptidases include trypsin and chymotrypsin (components of the complement cascade and the 
blood-clotting cascade) lysosomal cathepsins, calpains, pepsin, renin, and chymosin (Beynon, R. J. 
and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach, Oxford University Press, New 

25 York NY, pp. 1-5). 

Hie phosphodiesterases catalyze the hydrolysis of one of the two ester bonds in a 
phosphodiester compound. Phosphodiesterases are therefore crucial to a variety of cellular processes. 
Phosphodiesterases include DNA and RNA endo- and exo-nucleases, which are essential to cell 
growth and replication as well as protein synthesis. Another phosphodiesterase is acid 

3 o sphingomyelinase, which hydrolyzes the membrane phospholipid sphingomyelin to ceramide and 
phosphorylcholine. Phosphorylcholine is used in the synthesis of phosphatidylcholine, which is 
involved in numerous intracellular signaling pathways. Ceramide is an essential precursor for the 
generation of gangliosides, membrane lipids found in high concentration in neural tissue. Defective 
acid sphingomyelinase phosphodiesterase leads to a build-up of sphingomyelin molecules in 

3 5 lysosomes, resulting in Niemann-Pick disease. 
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Glycosidases catalyze the cleavage of hemiacetyl bonds of glycosides, which are compounds 
that contain one or more sugar. Mammalian lactase-phlorizin hydrolase, for example, is an intestinal 
enzyme that splits lactose. Mammalian beta-galactosidase removes the terminal galactose from 
gangliosides, glycoproteins, and glycosaminoglycans, and deficiency of this enzyme is associated 
5 with a gangliosidosis known as Morquio disease type B. Vertebrate lysosomal alpha-glucosidase, 
which hydrolyzes glycogen, maltose, and isomaltose, and vertebrate intestinal sucrase-isomaltase, 
which hydrolyzes sucrose, maltose, and isomaltose, are widely distributed members of this family 
with highly conserved sequences at their active sites. 

The glyoxylase system is involved in gluconeogenesis, the production of glucose from 

10 storage compounds in the body. It consists of glyoxylase I, which catalyzes the formation of S-D- 
lactoylglutathione from methyglyoxal, a side product of triose-phosphate energy metabolism, and 
glyoxylase II, which hydrolyzes S-D-lactoylglutathione to D-lactic acid and reduced glutathione. 
Glyoxylases are involved in hyperglycemia, non-insulin-dependent diabetes mellitus, the 
detoxification of bacterial toxins, and in the control of cell proliferation and microtubule assembly. 

15 Lyases 

Lyases are a class of enzymes that catalyze the cleavage of C-C, C-O, C-N, C-S, C-(halide), 
P-O or other bonds without hydrolysis or oxidation to form two molecules, at least one of which 
contains a double bond (Stryer, L. (1995) Biochemistry W.H. Freeman and Co. New York, NY 
p.620). Lyases are critical components of cellular biochemistry with roles in metabolic energy 

20 production including fatty acid metabolism, as well as other diverse enzymatic processes. Further 
classification of lyases reflects the type of bond cleaved as well as the nature of the cleaved group. 

The group of C-C lyases include carboxyl-lyases (decarboxylases), aldehyde-lyases 
(aldolases), oxo-acid-lyases and others. The C-O lyase group includes hydro-lyases, lyases acting on 
polysaccharides and other lyases. The C-N lyase group includes ammoni a-1 y ases, amidine-lyases, 

25 amine-lyases (deaminases) and other lyases. 

Proper regulation of lyases is critical to normal physiology. For example, mutation induced 
deficiencies in the uroporphyrinogen decarboxylase can lead to photosensitive cutaneous lesions in 
the genetically-linked disorder familial porphyria cutanea tarda (Mendez, M. et al. (1998) Am. J. 
Genet. 63 : 1 363- 1 375). It has also been shown that adenosine deaminase (ADA) deficiency stems 

3 o from genetic mutations in the ADA gene, resulting in the disorder severe combined 

immunodeficiency disease (SCID) (Hershfield, M.S. (1998) Semin. HematoL 35:291-298). 
Isomerases 

Isomer ases are a class of enzymes that catalyze geometric or structural changes within a 
molecule to form a single product. This class includes racemases and epimerases, cis-trans- 
35 isomerases, intramolecular oxidoreductases, intramolecular transferases (mutases) and intramolecular 
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lyases. Isomerases are critical components of cellular biochemistry with roles in metabolic energy 
production including glycolysis, as well as other diverse enzymatic processes (Stryer, L. (1995) 
Biochemistry . W.H, Freeman and Co., New York NY, pp.483-507). 

Racemases are a subset of isomerases that catalyze inversion of a molecules configuration 
5 around the asymmetric carbon atom in a substrate having a single center of asymmetry, thereby 

interconverting two racemers. Epimerases are another subset of isomerases that catalyze inversion of 
configuration around an asymmetric carbon atom in a substrate with more than one center of 
symmetry, thereby interconverting two epimers. Racemases and epimerases can act on amino acids 
and derivatives, hydroxy acids and derivatives, as well as carbohydrates and derivatives. The 

10 interconversion of UDP-galactose and UDP-glucose is catalyzed by UDP-galactose-4' -epimerase. 
Proper regulation and function of this epimerase is essential to the synthesis of glycoproteins and 
glycolipids. Elevated blood galactose levels have been correlated with UDP-galactose-4' -epimerase 
deficiency in screening programs of infants (Gitzelmann, R. (1972) Helv. Paediat. Acta 27:125-130). 
Oxidoreductases can be isomerases as well. Oxidoreductases catalyze the reversible transfer 

15 of electrons from a substrate that becomes oxidized to a substrate that becomes reduced. This class of 
enzymes includes dehydrogenases, hydroxylases, oxidases, oxygenases, peroxidases, and reductases. 
Proper maintenance of oxidoreductase levels is physiologically important. For example, genetically- 
linked deficiencies in lipoamide dehydrogenase can result in lactic acidosis (Robinson, B.H. et al. 
(1977) Pediat. Res. 11:1198-1202). 

20 Another subgroup of isomerases are the transferases (or mutases). Transferases transfer a 

chemical group from one compound (the donor) to another compound (the acceptor). The types of 
groups transferred by these enzymes include acyl groups, amino groups, phosphate groups 
(phosphotransferases or phosphomutases), and others. The transferase carnitine palmitoyltransferase 
is an important component of fatty acid metabolism. Genetically-linked deficiencies in this 

25 transferase can lead to myopathy (Scriver, C.R. et al. (1995) The Metabolic and Molecular Basis of 
Inherited Disease , McGraw-Hill, New York NY, pp.1501-1533). 

Yet another subgroup of isomerases are the topoisomersases. Topoisomerases are enzymes 
that affect the topological state of DNA. For example, defects in topoisomerases or their regulation 
can affect normal physiology. Reduced levels of topoisomerase II have been correlated with some of 

30 the DNA processing defects associated with the disorder ataxia-telangiectasia (Singh, S.P. et al. 
(1988) Nucleic Acids Res. 16:3919-3929). 
Li gases 

Ligases catalyze the formation of a bond between two substrate molecules. The process 
involves the hydrolysis of a pyrophosphate bond in ATP or a similar energy donor. Ligases are 
3 5 classified based on the nature of the type of bond they form, which can include carbon-oxygen, 
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carbon-sulfur, carbon-nitrogen, carbon-carbon and phosphoric ester bonds. 

Ligases forming carbon-oxygen bonds include the aminoacyl-transfer RNA (tRNA) 
synthetases which are important RNA-associated enzymes with roles in translation. Protein 
biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. The 
5 aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA. The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 
structural classes, and.each class is characterized by a distinctive topology of the catalytic domain. 
Class I enzymes contain a catalytic domain based on the nucleotide-binding Rossman fold. Class II 
enzymes contain a central catalytic domain, which consists of a seven-stranded antiparallel B-sheet 

10 motif, as well as N- and C- terminal regulatory domains. Class II enzymes are separated into two 

groups based on the heterodimeric or homodimeric structure of the enzyme; the latter group is further 
subdivided by the structure of the N- and C-terminal regulatory domains (Hartlein, M. and S. Cusack 
(1995) J. Mol. Evol. 40:519-530). Autoantibodies against aminoacyl-tRNAs are generated by 
patients with dermatomyositis and polymyositis, and correlate strongly with complicating interstitial 

15 lung disease (ILD). These antibodies appear to be generated in response to viral infection, and 
coxsackie virus has been used to induce experimental viral myositis in animals. 

Ligases forming carbon-sulfur bonds (Acid-thiol ligases) mediate a large number of cellular 
biosynthetic intermediary metabolism processes involve intermolecular transfer of carbon 
atom-containing substrates (carbon substrates). Examples of such reactions include the tricarboxylic 

20 acid cycle, synthesis of fatty acids and long-chain phospholipids, synthesis of alcohols and aldehydes, 
synthesis of intermediary metabolites, and reactions involved in the amino acid degradation 
pathways. Some of these reactions require input of energy, usually in the form of conversion of ATP 
to either ADP or AMP and pyrophosphate. 

In many cases, a carbon substrate is derived from a small molecule containing at least two 

2 5 carbon atoms. The carbon substrate is often covalently bound to a larger molecule which acts as a 
carbon substrate carrier molecule within the cell. In the biosynthetic mechanisms described above, 
the carrier molecule is coenzyme A. Coenzyme A (CoA) is structurally related to derivatives of the 
nucleotide ADP and consists of 4 , -phosphopantetheine linked via a phosphodiester bond to the alpha 
phosphate group of adenosine S'^'-bisphosphate. The terminal thiol group of 4 , -phosphopantetheine 

30 acts as the site for carbon substrate bond formation. The predominant carbon substrates which utilize 
CoA as a carrier molecule during biosynthesis and intermediary metabolism in the cell are acetyl, 
succinyl, and propionyl moieties, collectively referred to as acyl groups. Other carbon substrates 
include enoyl lipid, which acts as a fatty acid oxidation intermediate, and carnitine, which acts as an 
acetyl-CoA flux regulator/ mitochondrial acyl group transfer protein. Acyl-CoA and acetyl-CoA are 

35 synthesized in the cell by acyl -CoA synthetase and acetyl-CoA synthetase, respectively. 
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Activation of fatty acids is mediated by at least three forms of acyl-CoA synthetase activity: 
i) acetyl-CoA synthetase, which activates acetate and several other low molecular weight carboxylic 
acids and is found in muscle mitochondria and the cytosol of other tissues; ii) medium-chain 
acyl-CoA synthetase, which activates fatty acids containing between four and eleven carbon atoms 
5 (predominantly from dietary sources), and is present only in liver mitochondria; and iii) acyl CoA 
synthetase, which is specific for long chain fatty acids with between six and twenty carbon atoms, and 
is found in microsomes and the mitochondria. Proteins associated with acyl-CoA synthetase activity 
have been identified from many sources including bacteria, yeast, plants, mouse, and man. The 
activity of acyl-CoA synthetase may be modulated by phosphorylation of the enzyme by 

1 o cAMP-dependent protein kinase. 

Ligases forming carbon-nitrogen bonds include amide synthases such as glutamine synthetase 
(glutamate-ammonia ligase) that catalyzes the animation of glutamic acid to glutamine by ammonia 
using the energy of ATP hydrolysis. Glutamine is the primary source for the amino group in various 
amide transfer reactions involved in de novo pyrimidine nucleotide synthesis and in purine and 

15 pyrimidine ribonucleotide interconversions. Overexpression of glutamine synthetase has been 
observed in primary liver cancer (Christa, L. et al. (1994) Gastroent 106:1312-1320). 

Acid-amino-acid ligases (peptide synthases) are represented by the ubiquitin proteases which 
are associated with the ubiquitin conjugation system (UCS), a major pathway for the degradation of 
cellular proteins in eukaryotic cells and some bacteria. The UCS mediates the elimination of 

20 abnormal proteins and regulates the half -lives of important regulatory proteins that control cellular 
processes such as gene transcription and cell cycle progression. In the UCS pathway, proteins 
targeted for degradation are conjugated to a ubiquitin (Ub), a small heat stable protein. Ub is first 
activated by a ubiquitin-activating enzyme (El), and then transferred to one of several Ub- 
conjugating enzymes (E2). E2 then links the Ub molecule through its C-terminal glycine to an 

2 5 internal lysine (acceptor lysine) of a target protein. The ubiquitinated protein is then recognized and 

degraded by proteasome, a large, multisubunit proteolytic enzyme complex, and ubiquitin is released 
for reutilization by ubiquitin protease. The UCS is implicated in the degradation of mitotic cyclic 
kinases, oncoproteins, tumor suppressor genes such as p53, viral proteins, cell surface receptors 
associated with signal transduction, transcriptional regulators, and mutated or damaged proteins 
30 (Ciechanover, A. (1994) Cell 79:13-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin 
protease whose overexpression leads to oncogenic transformation of NIH3T3 cells, and the human 
homolog of this gene is consistently elevated in small cell tumors and adenocarcinomas of the lung 
(Gray, D.A. (1995) Oncogene 10:2179-2183). 

Cyclo-ligases and other carbon-nitrogen ligases comprise various enzymes and enzyme 

3 5 complexes that participate in the de novo pathways to purine and pyrimidine biosynthesis. Because 
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these pathways are critical to the synthesis of nucleotides for replication of both RN A and DN A, 
many of these enzymes have been the targets of clinical agents for the treatment of cell proliferative 
disorders such as cancer and infectious diseases. 

Purine biosynthesis occurs de novo from the amino acids glycine and glutamine, and other 
5 small molecules. Three of the key reactions in this process are catalyzed by a trifunctional enzyme 
composed of glycinamide-ribonucleotide synthetase (GARS), aminoimidazole ribonucleotide 
synthetase (AIRS), and glycinamide ribonucleotide transformylase (GART). Together these three 
enzymes combine ribosylamine phosphate with glycine to yield phosphoribosyl aminoimidazole, a 
precursor to both adenylate and guanylate nucleotides. This trifunctional protein has been implicated 

10 in the pathology of Downs syndrome (Aimi, J. et al. (1990) Nucleic Acid Res. 18:6665-6672). 

Adenylosuccinate synthetase catalyzes a later step in purine biosynthesis that converts inosinic acid to 
adenylosuccinate, a key step on the path to ATP synthesis. This enzyme is also similar to another 
carbon-nitrogen ligase, argininosuccinate synthetase, that catalyzes a similar reaction in the urea cycle 
(Powell, S.M. et al. (1992) FEBS Lett. 303:4-10). 

is Like the de novo biosynthesis of purines, de novo synthesis of the pyrimidine nucleotides 

uridylate and cytidylate also arises from a common precursor, in this instance the nucleotide 
orotidylate derived from orotate and phosphoribosyl pyrophosphate (PPRP). Again a trifunctional 
enzyme comprising three carbon-nitrogen ligases plays a key role in the process. In this case the 
enzymes aspartate transcarbamylase (ATCase), carbamyl phosphate synthetase II, and dihydroorotase 

2 o (DHOase) are encoded by a single gene called CAD. Together these three enzymes combine the 

initial reactants in pyrimidine biosynthesis, glutamine, CO^ and ATP to form dihydroorotate, the 
precursor to orotate and orotidylate (Iwahana, H. et al. (1996) Biochem. Biophys. Res. Commun. 
219:249-255). Further steps then lead to the synthesis of uridine nucleotides from orotidylate. 
Cytidine nucleotides are derived from uridine-S'-triphosphate (UTP) by the amidation of UTP using 
25 glutamine as the amino donor and the enzyme CTP synthetase. Regulatory mutations in the human 
CTP synthetase are believed to confer multi-drug resistance to agents widely used in cancer therapy 
(Yamauchi, M. et al. (1990) EMBO J. 9:2095-2099). 

Ligases forming carbon-carbon bonds include the carboxylases acetyl-CoA carboxylase and 
pyruvate carboxylase. Acetyl-CoA carboxylase catalyzes the carboxylation of acetyl-CoA from C0 2 

3 o and H 2 0 using the energy of ATP hydrolysis. Acetyl-CoA carboxylase is the rate-limiting step in the 

biogenesis of long-chain fatty acids. Two isoforms of acetyl-CoA carboxylase, types I and types II, 
are expressed in human in a tissue-specific manner (Ha, J. et al. (1994) Eur. J. Biochem. 219:297- 
306). Pyruvate carboxylase is a nuclear-encoded mitochondrial enzyme that catalyzes the conversion 
of pyruvate to oxaloacetate, a key intermediate in the citric acid cycle. 
3 5 Ligases forming phosphoric ester bonds include the DNA ligases involved in both DNA 
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replication and repair. DNA ligases seal phosphodiester bonds between two adjacent nucleotides in a 
DNA chain using the energy from ATP hydrolysis to first activate the free 5' -phosphate of one 
nucleotide and then react it with the 3' -OH group of the adjacent nucleotide. This resealing reaction 
is used in both DNA replication to join small DNA fragments called Okazaki fragments that are 
5 transiently formed in the process of replicating new DNA, and in DNA repair. DNA repair is the 
process by which accidental base changes, such as those produced by oxidative damage, hydrolytic 
attack, or uncontrolled methylation of DNA, are corrected before replication or transcription of the 
DNA can occur. Bloom's syndrome is an inherited human disease in which individuals are partially 
deficient in DNA ligation and consequently have an increased incidence of cancer (Alberts, B. et al. 
10 (1994) The Molecular Biology of the Cell , Garland Publishing Inc., New York NY, p. 247). 

Molecules Associated with Growth and Development 

SEQ ID NO:69, SEQ ID NO:70, and SEQ ID NO.71 encode, for example, molecules 
associated with growth and development. 

1 5 Human growth and development requires the spatial and temporal regulation of cell 

differentiation, cell proliferation, and apoptosis. These processes coordinately control reproduction, 
aging, embryogenesis, morphogenesis, organogenesis, and tissue repair and maintenance. At the 
cellular level, growth and development is governed by the cell's decision to enter into or exit from the 
cell division cycle and by the cell's commitment to a terminally differentiated state. These decisions 

20 are made by the cell in response to extracellular signals and other environmental cues it receives. The 
following discussion focuses on the molecular mechanisms of cell division, reproduction, cell 
differentiation and proliferation, apoptosis, and aging. 
Cell Division 

Cell division is the fundamental process by which all living things grow and reproduce. In 
2 5 unicellular organisms such as yeast and bacteria, each cell division doubles the number of organisms, 
while in multicellular species many rounds of cell division are required to replace cells lost by wear or 
by programmed cell death, and for cell differentiation to produce a new tissue or organ. Details of the 
cell division cycle may vary, but the basic process consists of three principle events. The first event, 
interphase, involves preparations for cell division, replication of the DNA, and production of essential 
30 proteins. In the second event, mitosis, the nuclear material is divided and separates to opposite sides of 
the cell. The final event, cytokinesis, is division and fission of the cell cytoplasm. The sequence and 
timing of cell cycle transitions is under the control of the cell cycle regulation system which controls the 
process by positive or negative regulatory circuits at various check points. 

Regulated progression of the cell cycle depends on the integration of growth control pathways 
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with the basic cell cycle machinery. Cell cycle regulators have been identified by selecting for human 
and yeast cDN As that block or activate cell cycle arrest signals in the yeast mating pheromone pathway 
when they are overexpressed. Known regulators include human CPR (cell cycle progression 
restoration) genes, such as CPR8 and CPR2, and yeast CDC (cell division control) genes, including 
5 CDC91, that block the arrest signals. The CPR genes express a variety of proteins including cyclins, 
tumor suppressor binding proteins, chaperones, transcription factors, translation factors, and 
RNA-binding proteins (Edwards, M.C. et al.(1997) Genetics 147:1063-1076). 

Several cell cycle transitions, including the entry and exit of a cell from mitosis, are dependent 
upon the activation and inhibition of cyclin-dependent kinases (Cdks). The Cdks are composed of a 

10 kinase subunit, Cdk, and an activating subunit, cyclin, in a complex that is subject to many levels of 
regulatioa There appears to be a single Cdk in Saccharomvces cerevisiae and Saccharomvces pombe 
whereas mammals have a variety of specialized Cdks. Cyclins act by binding to and activating 
cyclin-dependent protein kinases which then phosphorylate and activate selected proteins involved in the 
mitotic process. The Cdk-cyclin complex is both positively and negatively regulated by 

15 phosphorylation, and by targeted degradation involving molecules such as CDC4 and CDC53. In 
addition, Cdks are further regulated by binding to inhibitors and other proteins such as Sucl that 
modify their specificity or accessibility to regulators (Patra, D. and W.G. Dunphy (1996) Genes Dev. 
10:1503-1515; and Mathias, N. et al. (1996) Mol. Cell Biol. 16:6634-6643). 
Reproduction 

2 o The male and female reproductive systems are complex and involve many aspects of growth 

and development. The anatomy and physiology of the male and female reproductive systems are 
reviewed in (Guyton, A.C. (1991) Textbook of Medical Physiology . W.B. Saunders Co., Philadelphia 
PA, pp. 899-928). 

The male reproductive system includes the process of spermatogenesis, in which the sperm are 

2 5 formed, and male reproductive functions are regulated by various hormones and their effects on 

accessory sexual organs, cellular metabolism, growth, and other bodily functions. 

Spermatogenesis begins at puberty as a result of stimulation by gonadotropic hormones 
released from the anterior pituitary. Immature sperm (spermatogonia) undergo several mitotic cell 
divisions before undergoing meiosis and full maturation. The testes secrete several male sex hormones, 

3 o the most abundant being testosterone, that is essential for growth and division of the immature sperm, 

and for the masculine characteristics of the male body. Three other male sex hormones, gonadotropin- 
releasing hormone (GnRH), luteinizing hormone (LH), and follicle-stimulating hormone (FSH) control 
sexual function. 

The uterus, ovaries, fallopian tubes, vagina, and breasts comprise the female reproductive 
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system. The ovaries and uterus are the source of ova and the location of fetal development, 
respectively. The fallopian tubes and vagina are accessory organs attached to the top and bottom of the 
uterus, respectively. Both the uterus and ovaries have additional roles in the development and loss of 
reproductive capability during a female's lifetime. The primary role of the breasts is lactation. 
5 Multiple endocrine signals from the ovaries, uterus, pituitary, hypothalamus, adrenal glands, and other 
tissues coordinate reproduction and lactation. These signals vary during the monthly menstruation 
cycle and during the female's lifetime. Similarly, the sensitivity of reproductive organs to these 
endocrine signals varies during the female's lifetime. 

A combination of positive and negative feedback to the ovaries, pituitary and hypothalamus 

10 glands controls physiologic changes during the monthly ovulation and endometrial cycles. The anterior 
pituitary secretes two major gonadotropin hormones, follicle-stimulating hormone (FSH) and luteinizing 
hormone (LH), regulated by negative feedback of steroids, most notably by ovarian estradiol. If 
fertilization does not occur, estrogen and progesterone levels decrease. This sudden reduction of the 
ovarian hormones leads to menstruation, the desquamation of the endometrium. 

15 Hormones further govern all the steps of pregnancy, parturition, lactation, and menopause. 

During pregnancy large quantities of human chorionic gonadotropin (hCG), estrogens, progesterone, 
and human chorionic somatomammotropin (hCS) are formed by the placenta. hCG, a glycoprotein 
similar to luteinizing hormone, stimulates the corpus luteum to continue producing more progesterone 
and estrogens, rather than to involute as occurs if the ovum is not fertilized. hCS is similar to growth 

2 o hormone and is crucial for fetal nutrition. 

The female breast also matures during pregnancy. Large amounts of estrogen secreted by the 
placenta trigger growth and branching of the breast milk ductal system while lactation is initiated by the 
secretion of prolactin by the pituitary gland. 

Parturition involves several hormonal changes that increase uterine contractility toward the end 

25 of pregnancy, as follows. The levels of estrogens increase more than those of progesterone. Oxytocin 
is secreted by the neurohypophysis. Concomitantly, uterine sensitivity to oxytocin increases. The fetus 
itself secretes oxytocin, Cortisol (from adrenal glands), and prostaglandins. 

Menopause occurs when most of the ovarian follicles have degenerated The ovary then 
produces less estradiol, reducing the negative feedback on the pituitary and hypothalamus glands. 

30 Mean levels of circulating FSH and LH increase, even as ovulatory cycles continue. Therefore, the 

ovary is less responsive to gonadotropins, and there is an increase in the time between menstrual cycles. 
Consequently, menstrual bleeding ceases and reproductive capability ends. 
Cell Differentiation and Proliferation 

Tissue growth involves complex and ordered patterns of cell proliferation, cell differentiation, 
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and apoptosis. Cell proliferation must be regulated to maintain both the number of cells and their 
spatial organization. This regulation depends upon the appropriate expression of proteins which control 
cell cycle progression in response to extracellular signals, such as growth factors and other mitogens, 
and intracellular cues, such as DNA damage or nutrient starvation. Molecules which directly or 
5 indirectly modulate cell cycle progression fall into several categories, including growth factors and their 
receptors, second messenger and signal transduction proteins, oncogene products, tumor-suppressor 
proteins, and mitosis-promoting factors. 

Growth factors ware originally described as serum factors required to promote cell 
proliferation. Most growth factors are large, secreted polypeptides that act on cells in their local 

10 environment Growth factors bind to and activate specific cell surface receptors and initiate 

intracellular signal transduction cascades. Many growth factor receptors are classified as receptor 
tyrosine kinases which undergo autophosphorylation upon ligand binding. Autophosphorylation 
enables the receptor to interact with signal transduction proteins characterized by the presence of SH2 
or SH3 domains (Src homology regions 2 or 3). These proteins then modulate the activity state of 

15 small G-proteins, such as Ras, Rab, and Rho, along with GTPase activating proteins (GAPs), guanine 
nucleotide releasing proteins (GNRPs), and other guanine nucleotide exchange factors. Small G 
proteins act as molecular switches that activate other downstream events, such as mitogen-activated 
protein kinase (MAP kinase) cascades. MAP kinases ultimately activate transcription of mitosis- 
promoting genes. 

20 In addition to growth factors, small signaling peptides and hormones also influence cell 

proliferation. These molecules bind primarily to another class of receptor, the trimeric G-protein 
coupled receptor (GPCR), found predominantly on the surface of immune, neuronal and neuroendocrine 
cells. Upon ligand binding, the GPCR activates a trimeric G protein which in turn triggers increased 
levels of intracellular second messengers such as phospholipase C, Ca2+, and cyclic AMP. Most 

2 5 GPCR-mediated signaling pathways indirectly promote cell proliferation by causing the secretion or 
breakdown of other signaling molecules that have direct autogenic effects. These signaling cascades 
often involve activation of kinases and phosphatases. Some growth factors, such as some members of 
the transforming growth factor beta (TGF-p) family, act on some cells to stimulate cell proliferation 
and on other cells to inhibit it. Growth factors may also stimulate a cell at one concentration and inhibit 

30 the same cell at another concentration. Most growth factors also have a multitude of other actions 
besides the regulation of cell growth and division: they can control the proliferation, survival, 
differentiation, migration, or function of cells depending on the circumstance. For example, the tumor 
necrosis factor/nerve growth factor (TNF/NGF) family can activate or inhibit cell death, as well as 
regulate proliferation and differentiation. The cell response depends on the type of cell, its stage of 
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differentiation and transformation status, which surface receptors are stimulated, and the types of 
stimuli acting on the cell (Smith, A. et al. (1994) Cell 76:959-962; and Nocentini, G. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:6216-6221). 

Neighboring cells in a tissue compete for growth factors, and when provided with "unlimited" 
5 quantities in a pa-fused system will grow to even higher cell densities before reaching density-dependent 
inhibition of cell division. Cells often demonstrate an anchorage dependence of cell division as well. 
This anchorage dependence may be associated with the formation of focal contacts linking the 
cytoskeleton with the extracellular matrix (ECM). The expression of ECM components can be 
stimulated by growth factors. For example, TGF-p stimulates fibroblasts to produce a variety of ECM 

10 proteins, including fibronectin, collagen, and tenascin (Pearson, C.A. et al. (1988) EMBO J. 7:2677- 
2981). In fact, for some cell types specific ECM molecules, such as laminin or fibronectin, may act as 
growth factors. Tenascin-C and -R, expressed in developing and lesioned neural tissue, provide 
stimulatory/anti-adhesive or inhibitory properties, respectively, for axonal growth (Faissner, A. (1997) 
Cell Tissue Res. 290:331-341). 

15 Cancers are associated with the activation of oncogenes which are derived from normal cellular 

genes. These oncogenes encode oncoproteins which convert normal cells into malignant cells. Some 
oncoproteins are mutant isoforms of the normal protein, and other oncoproteins are abnormally 
expressed with respect to location or amount of expression. The latter category of oncoprotein causes 
cancer by altering transcriptional control of cell proliferation. Five classes of oncoproteins are known 

20 to affect cell cycle controls. These classes include growth factors, growth factor receptors, intracellular 
signal transducers, nuclear transcription factors, and cell-cycle control proteins. Viral oncogenes are 
integrated into the human genome after infection of human cells by certain viruses. Examples of viral 
oncogenes include v-src, v-abl, and v-fps. 

Many oncogenes have been identified and characterized. These include sis, erbA, erbB, her-2, 

25 mutated G s> src, abl, ras, crk, jun, fos, myc, and mutated tumor-suppressor genes such as RB, p53, 
mdm2, Cipl , pi 6, and cyclin D. Transformation of normal genes to oncogenes may also occur by 
chromosomal translocation. The Philadelphia chromosome, characteristic of chronic myeloid leukemia 
and a subset of acute lymphoblastic leukemias, results from a reciprocal translocation between 
chromosomes 9 and 22 that moves a truncated portion of the proto-oncogene c-abl to the breakpoint 

3 o cluster region (bcr) on chromosome 22. 

Tumor-suppressor genes are involved in regulating cell proliferation. Mutations which cause 
reduced or loss of function in tumor-suppressor genes result in uncontrolled cell proliferation. For 
example, the retinoblastoma gene product (RB), in a non-phosphorylated state, binds several early- 
response genes and suppresses their transcription, thus blocking cell division. Phosphorylation of RB 

20 
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causes it to dissociate from the genes, releasing the suppression, and allowing cell division to proceed. 
Apoptosis 

Apoptosis is the genetically controlled process by which unneeded or defective cells undergo 
programmed cell death. Selective elimination of cells is as important for morphogenesis and tissue 
5 remodeling as is cell proliferation and differentiation. Lack of apoptosis may result in hyperplasia and 
other disorders associated with increased cell proliferation. Apoptosis is also a critical component of 
the immune response. Immune cells such as cytotoxic T-cells and natural killer cells prevent the spread 
of disease by inducing apoptosis in tumor cells and virus-infected cells. In addition, immune cells that 
fail to distinguish self molecules from foreign molecules must be eliminated by apoptosis to avoid an 

1 o autoimmune response. 

Apoptotic cells undergo distinct morphological changes. Hallmarks of apoptosis include cell 
shrinkage, nuclear and cytoplasmic condensation, and alterations in plasma membrane topology. 
Biochemically, apoptotic cells are characterized by increased intracellular calcium concentration, 
fragmentation of chromosomal DNA, and expression of novel cell surface components. 

1 5 The molecular mechanisms of apoptosis are highly conserved, and many of the key protein 

regulators and effectors of apoptosis have been identified. Apoptosis generally proceeds in response to 
a signal which is transduced intracellular^ and results in altered patterns of gene expression and protein 
activity. Signaling molecules such as hormones and cytokines are known both to stimulate and to 
inhibit apoptosis through interactions with cell surface receptors. Transcription factors also play an 

20 important role in the onset of apoptosis. A number of downstream effector molecules, particularly 
proteases such as the cysteine proteases called caspases, have been implicated in the degradation of 
cellular components and the proteolytic activation of other apoptotic effectors. 
Aging and Senescence 

Studies of the aging process or senescence have shown a number of characteristic cellular and 
25 molecular changes (Fauci et al. (1998) Harrison's Principles of Internal Medicine . McGraw-Hill, New 
York NY, p.37). These characteristics include increases in chromosome structural abnormalities, DNA 
cross-linking, incidence of single-stranded breaks in DNA, losses in DNA methylation, and degradation 
of telomere regions. In addition to these DNA changes, post-translational alterations of proteins 
increase including, deamidation, oxidation, cross-linking, and nonenzymatic glycation. Still further 
3 o molecular changes occur in the mitochondria of aging cells through deterioration of structure. These 
changes eventually contribute to decreased function in every organ of the body. 

Biochemical Pathway Molecules 

SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, and SEQ ID NO:68 
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encode, for example, biochemical pathway molecules. 

Biochemical pathways are responsible for regulating metabolism, growth and development, 
protein secretion and trafficking, environmental responses, and ecological interactions including 
immune response and response to parasites. 
5 DNA replication 

Deoxyribonucleic acid (DNA), the genetic material, is found in both the nucleus and 
mitochondria of human cells. The bulk of human DNA is nuclear, in the form of linear chromosomes, 
while mitochondrial DNA is circular. DNA replication begins at specific sites called origins of 
replication. Bidirectional synthesis occurs from the origin via two growing forks that move in opposite 

10 directions. Replication is semi-conservative, with each daughter duplex containing one old strand and 
its newly synthesized complementary partner. Proteins involved in DNA replication include DNA 
polymerases, DNA primase, telomerase, DNA helicase, topoisomerases, DNA ligases, replication 
factors, and DNA-binding proteins. 
DNA Recombination and Repair 

15 Cells are constantly faced with replication errors and environmental assault (such as ultraviolet 

irradiation) that can produce DNA damage. Damage to DNA consists of any change that modifies the 
structure of the molecule. Changes to DNA can be divided into two general classes, single base 
changes and structural distortions. Any damage to DNA can produce a mutation, and the mutation may 
produce a disorder, such as cancer. 

2 o Changes in DNA are recognized by repair systems within the cell. These repair systems act to 

correct the damage and thus prevent any deleterious affects of a mutational event. Repair systems can 
be divided into three general types, direct repair, excision repair, and retrieval systems. Proteins 
involved in DNA repair include DNA polymerase, excision repair proteins, excision and cross link 
repair proteins, recombination and repair proteins, R AD5 1 proteins, and BLN and WRN proteins that 

25 are homologs of RecQ helicase. When the repair systems are eliminated, cells become exceedingly 

sensitive to environmental mutagens, such as ultraviolet irradiation. Patients with disorders associated 
with a loss in DNA repair systems often exhibit a high sensitivity to environmental mutagens. 
Examples of such disorders include xeroderma pigmentosum (XP), Bloom's syndrome (BS), and 
Werner's syndrome (WS) (Yamagata, IC et al. (1998) Proc. Natl. Acad. Sci. USA 95:8733-8738), 

30 ataxia telangiectasia, Cockayne's syndrome, and Fanconi's anemia. 

Recombination is the process whereby new DNA sequences are generated by the movements of 
large pieces of DNA. In homologous recombination, which occurs during meiosis and DNA repair, 
parent DNA duplexes align at regions of sequence similarity, and new DNA molecules form by the 
breakage and joining of homologous segments. Proteins involved include RAD5 1 recombinase. In site- 



22 



WO 01/21836 



PCT/US00/25643 



specific recombination, two specific but not necessarily homologous DNA sequences are exchanged. In 
the immune system this process generates a diverse collection of antibody and T cell receptor genes. 
Proteins involved in site-specific recombination in the immune system include recombination activating 
genes 1 and 2 (RAG1 and RAG2). A defect in immune system site-specific recombination causes 
5 severe combined immunodeficiency disease in mice. 
RNA Metabolism 

Ribonucleic acid (RNA) is a linear single-stranded polymer of four nucleotides, ATP, CTP, 
UTP, and GTP. In most organisms, RNA is transcribed as a copy of DNA, the genetic material of the 
organism. In retroviruses RNA rather than DNA serves as the genetic material. RNA copies of the 

10 genetic material encode proteins or serve various structural, catalytic, or regulatory roles in organisms. 
RNA is classified according to its cellular localization and function. Messenger RNAs (mRNAs) 
encode polypeptides. Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into 
ribosomes, which are cytoplasmic particles that translate mRNA into polypeptides. Transfer RNAs 
(tRNAs) are cytosolic adaptor molecules that function in mRNA translation by recognizing both an 

1 5 mRNA codon and the amino acid that matches that codon. Heterogeneous nuclear RNAs (hnRN As) 
include mRNA precursors and other nuclear RNAs of various sizes. Small nuclear RNAs (snRNAs) 
are a part of the nuclear spliceosome complex that removes intervening, non-coding sequences (introns) 
and rejoins exons in pre-mRN As. 
RNA Transcription 

2 0 The transcription process synthesizes an RNA copy of DNA. Proteins involved include multi- 

subunit RNA polymerases, transcription factors IIA, IIB, IID, IIE, IIF, IIH, and IIJ. Many 
transcription factors incorporate DNA-binding structural motifs which comprise either ce -helices or P- 
sheets that bind to the major groove of DNA. Four well-characterized structural motifs are helix-turn- 
helix, zinc finger, leucine zipper, and helix-loop-helix. 
25 RNA Processing 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre-mRNA 
processing steps include capping at the 5* end with methylguanosine, polyadenylating the 3' end, and 
splicing to remove introns. The spliceosomal complex is comprised of five small nuclear 
ribonucleoprotein particles (snRNPs) designated Ul, U2, U4, U5, and U6. Each snRNP contains a 

3 o single species of snRNA and about ten proteins. The RNA components of some snRNPs recognize and 

base-pair with intron consensus sequences. The protein components mediate spliceosome assembly and 
the splicing reaction. Autoantibodies to snRNP proteins are found in the blood of patients with 
systemic lupus erythematosus (Stryer, L. (1995) Biochemistry W.H. Freeman and Company, New 
York NY, p. 863). 
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Heterogeneous nuclear ribonucleoproteins (hnRNPs) have been identified that have roles in 
splicing, exporting of the mature RNAs to the cytoplasm, and mRNA translation (Biamonti, G. et al. 
(1998) Clin. Exp. Rheumatol. 16:317-326). Some examples of hnRNPs include the yeast proteins 
Hrplp, involved in cleavage and polyadenylation at the 3' end of the RNA; Cbp80p, involved in 
5 capping the 5* end of the RNA; and Npl3p, a homolog of mammalian hnRNP Al, involved in export of 
mRNA from the nucleus (Shen, B.C. et al. (1998) Genes Dev. 12:679-691). HnRNPs have been shown 
to be important targets of the autoimmune response in rheumatic diseases (Biamonti, supra) . 

Many snRNP proteins, hnRNP proteins, and alternative splicing factors are characterized by 
an RNA recognition motif (RRM). (Reviewed in Birney, E. et al. (1993) Nucleic Acids Res. 21 :5803- 

10 5816.) The RRM is about 80 amino acids in length and forms four p-strands and two a-helices 
arranged in an ct/p sandwich. The RRM contains a core RNP-1 octapeptide motif along with 
surrounding conserved sequences. 
RNA Stability and Degradation 

RNA helicases alter and regulate RNA conformation and secondary structure by using energy 

15 derived from ATP hydrolysis to destabilize and unwind RNA duplexes. The most well-characterized 
and ubiquitous family of RNA helicases is the DEAD-box family, so named for the conserved B-type 
ATP-binding motif which is diagnostic of proteins in this family. Over 40 DEAD-box helicases have 
been identified in organisms as diverse as bacteria, insects, yeast, amphibians, mammals, and plants. 
DEAD-box helicases function in diverse processes such as translation initiation, splicing, ribosome 

20 assembly, and RNA editing, transport, and stability. Some DEAD-box helicases play tissue- and stage- 
specific roles in spermatogenesis and embryogenesis. (Reviewed in Linda*, P. et al. (1989) Nature 
337:121-122.) 

Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the progression of 
neuroblastoma (Nb) and retinoblastoma (Rb) tumors. Other DEAD-box helicases have been implicated 

2 5 either directly or indirectly in ultraviolet light-induced tumors, B cell lymphoma, and myeloid 
malignancies. (Reviewed in Godbout, R. et al. (1998) J. Biol. Chem. 273:21 161-21 168.) 

Ribonucleases (RNases) catalyze the hydrolysis of phosphodi ester bonds in RNA chains, thus 
cleaving the RNA. For example, RNase P is a ribonucleoprotein enzyme which cleaves the 5* end of 
pre-tRNAs as part of their maturation process. RNase H digests the RNA strand of an RNA/DNA 

30 hybrid. Such hybrids occur in cells invaded by retroviruses, and RNase H is an important enzyme in 
the retroviral replication cycle. RNase H domains are often found as a domain associated with reverse 
transcriptases. RNase activity in serum and cell extracts is elevated in a variety of cancers and 
infectious diseases (Schein, C.H. (1997) Nat. Biotechnol. 15:529-536). Regulation of RNase activity is 
being investigated as a means to control tumor angiogenesis, allergic reactions, viral infection and 
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replication, and fungal infections. 
Protein Translation 

The eukaryotic ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, 
which together form the 80S ribosome. In addition to the 18S, 28S, 5S, and 5.8S rRNAs, the ribosome 
5 also contains more than fifty proteins. The ribosomal proteins have a prefix which denotes the subunit 
to which they belong, either L (large) or S (small). Three important sites are identified on the ribosome. 
The aminoacyl-tRNA site (A site) is where charged tRNAs (with the exception of the initiator-tRNA) 
bind on arrival at the ribosome. The peptidyl-tRNA site (P site) is where new peptide bonds are 
formed, as well as where the initiator tRNA binds. The exit site (E site) is where deacylated tRNAs 
10 bind prior to their release from the ribosome. (Translation is reviewed in Stryer, L. (1995) 

Biochemistry , W.H. Freeman and Company, New York NY, pp. 875-908; and Lodish, H. et al. (1995) 
Molecular Cell Biology , Scientific American Books, New York NY, pp. 119-138.) 
tRNA Charging 

Protein biosynthesis depends on each amino acid forming a linkage with the appropriate tRNA. 

15 The aminoacyl-tRNA synthetases are responsible for the activation and correct attachment of an amino 
acid with its cognate tRNA The 20 aminoacyl-tRNA synthetase enzymes can be divided into two 
structural classes, Class I and Class II. Autoantibodies against aminoacyl-tRNAs are generated by 
patients with dermatomyositis and polymyositis, and correlate strongly with complicating interstitial 
lung disease (ELD). These antibodies appear to be generated in response to viral infection, and 

20 coxsackie virus has been used to induce experimental viral myositis in animals. 
Translation Initiation 

Initiation of translation can be divided into three stages. The first stage brings an initiator 
transfer RNA (Met-tRNAf) together with the 40S ribosomal subunit to form the 43S preinitiation 
complex. The second stage binds the 43S preinitiation complex to the mRNA, followed by migration of 
2 5 the complex to the correct AUG initiation codoit The third stage brings the 60S ribosomal subunit to 
the 40S subunit to generate an 80S ribosome at the initiation codon. Regulation of translation primarily 
involves the first and second stage in the initiation process (Pain, V.M. (1996) Eur. J. Biochem. 
236:747-771). 

Several initiation factors, many of which contain multiple subunits, are involved in bringing an 
30 initiator tRNA and 40S ribosomal subunit together. eIF2, a guanine nucleotide binding protein, recruits 
the initiator tRNA to the 40S ribosomal subunit. Only when eIF2 is bound to GTP does it associate 
with the initiator tRNA eIF2B, a guanine nucleotide exchange protein, is responsible for converting 
eIF2 from the GDP-bound inactive form to the GTP-bound active form. Two other factors, eEFl A and 
eIF3 bind and stabilize the 40S subunit by interacting with 1 8S ribosomal RNA and specific ribosomal 
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structural proteins. eIF3 is also involved in association of the 40S ribosomal subunit with mRNA. The 
Met-tRNA f , elFl A, eIF3, and 40S ribosomal subunit together make up the 43S preinitiation complex 
(Pain, su praV 

Additional factors are required for binding of the 43S preinitiation complex to an mRNA 
5 molecule, and the process is regulated at several levels. eIF4F is a complex consisting of three proteins: 
eIF4E, eIF4A, and eIF4G. eIF4E recognizes and binds to the mRNA 5 -terminal m 7 GTP cap, eDF4A is 
a bidirectional RNA-dependent helicase, and eIF4G is a scaffolding polypeptide. eIF4G has three 
binding domains. The N-terminal third of eBF4G interacts with eBF4E, the central third interacts with 
eIF4A, and the C -terminal third interacts with eIF3 bound to the 43S preinitiation complex. Thus, 
10 eEF4G acts as a bridge between the 40S ribosomal subunit and the mRNA (Hentze, M.W. (1997) 
Science 275:500-501). 

The ability of eIF4F to initiate binding of the 43S preinitiation complex is regulated by 
structural features of the mRNA. The mRNA molecule has an untranslated region (UTR) between the 
5' cap and the AUG start codon. In some mRNAs this region forms secondary structures that impede 
1 5 binding of the 43S preinitiation complex. The helicase activity of eIF4A is thought to function in 

removing this secondary structure to facilitate binding of the 43S preinitiation complex (Pain, supra) . 
Translation Elongation 

Elongation is the process whereby additional amino acids are joined to the initiator methionine 
to form the complete polypeptide chain. The elongation factors EFla, EFlp y, and EF2 are involved 

20 in elongating the polypeptide chain following initiation. EFla is a GTP-binding protein. InEFla's 
GTP-bound form, it brings an aminoacyl-tRNA to the ribosome* s A site. The amino acid attached to 
the newly arrived aminoacyl-tRNA forms a peptide bond with the initiator methionine. The GTP on 
EFla is hydrolyzed to GDP, and EFla-GDP dissociates from the ribosome. EFip y binds EFla -GDP 
and induces the dissociation of GDP from EFla, allowing EFla to bind GTP and a new cycle to begin. 

25 As subsequent aminoacyl-tRNAs are brought to the ribosome, EF-G, another GTP-binding 

protein, catalyzes the translocation of tRNAs from the A site to the P site and finally to the E site of the 
ribosome. This allows the processivity of translation. 
Translation Termination 

The release factor eRF carries out termination of translation. eRF recognizes stop codons in 

30 the mRNA, leading to the release of the polypeptide chain from the ribosome. 
Post-Translational Pathways 

Proteins may be modified after translation by the addition of phosphate, sugar, prenyl, fatty 
acid, and other chemical groups. These modifications are often required for proper protein activity. 
Enzymes involved in post-translational modification include kinases, phosphatases, 
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glycosyltransferases, and prenyltransferases. The conformation of proteins may also be modified after 
translation by the introduction and rearrangement of disulfide bonds (rearrangement catalyzed by 
protein disulfide isomerase), the isomerization of proline sidechains by prolyl isomerase, and by 
interactions with molecular chaperone proteins. 
5 Proteins may also be cleaved by proteases. Such cleavage may result in activation, 

inactivation, or complete degradation of the protein. Proteases include serine proteases, cysteine 
proteases, aspartic proteases, and metalloproteases. Signal peptidase in the endoplasmic reticulum 
(ER) lumen cleaves the signal peptide from membrane or secretory proteins that are imported into the 
ER. Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 

10 pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria. The UCS 
mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the 
UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable 
protein. Proteins involved in the UCS include ubiquitin-activating enzyme, ubiquitin-conjugating 

is enzymes, ubiquitin-ligases, and ubiquitin C-terminal hydrolases. The ubiquitinated protein is then 
recognized and degraded by the proteasome, a large, multisubunit proteolytic enzyme complex, and 
ubiquitin is released for reutilization by ubiquitin protease. 
Lipid Metabolism 

Lipids are water-insoluble, oily or greasy substances that are soluble in nonpolar solvents such 
20 as chloroform or ether. Neutral fats (triacylglycerols) serve as major fuels and energy stores. Polar 
lipids, such as phospholipids, sphingolipids, glycolipids, and cholesterol, are key structural components 
of cell membranes. 

Lipid metabolism is involved in human diseases and disorders. In the arterial disease 
atherosclerosis, fatty lesions form on the inside of the arterial wall. These lesions promote the loss of 

2 5 arterial flexibility and the formation of blood clots (Guyton, AC. Textbook of Medical Physiology 

(1991) W.B. Saunders Company, Philadelphia PA, pp.760-763). In Tay-Sachs disease, the GM 2 
ganglioside (a sphingolipid) accumulates in lysosomes of the central nervous system due to a lack of the 
enzyme N-acetylhexosaminidase. Patients suffer nervous system degeneration leading to early death 
(Fauci, AS. et al. (1998) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY, p. 

3 o 2171). The Niemann-Pick diseases are caused by defects in lipid metabolism. Niemann-Pick diseases 

types A and B are caused by accumulation of sphingomyelin (a sphingolipid) and other lipids in the 
central nervous system due to a defect in the enzyme sphingomyelinase, leading to neurodegeneration 
and lung disease. Niemann-Pick disease type C results from a defect in cholesterol transport, leading to 
the accumulation of sphingomyelin and cholesterol in lysosomes and a secondary reduction in 
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sphingomyelinase activity. Neurological symptoms such as grand mal seizures, ataxia, and loss of 
previously learned speech, manifest 1-2 years after birth. A mutation in the NPC protein, which 
contains a putative cholesterol-sensing domain, was found in a mouse model of Niemann-Pick disease 
type C (Fauci, supra , p. 2175; Loftus, S.KL et al. (1997) Science 277:232-235). (Lipid metabolism is 
5 reviewed in Stryer, L. (1995) Biochemistry . W.H. Freeman and Company, New York NY; Lehninger, 
A. (1982) Principles of Biochemistry Worth Publishers, Inc., New York NY; and ExPASy 
"Biochemical Pathways" index of Boehringer Mannheim World Wide Web site.) 
Fatty Acid Synthesis 

Fatty acids are long-chain organic acids with a single carboxyl group and a long non-polar 

10 hydrocarbon tail. Long-chain fatty acids are essential components of glycolipids, phospholipids, and 
cholesterol, which are building blocks for biological membranes, and of triglycerides, which are 
biological fuel molecules. Long-chain fatty acids are also substrates for eicosanoid production, and are 
important in the functional modification of certain complex carbohydrates and proteins. 1 6-carbon and 
18-carbon fatty acids are the most common. 

15 Fatty acid synthesis occurs in the cytoplasm. In the first step, acetyl-Coenzyme A (CoA) 

carboxylase (ACC) synthesizes malonyl-CoA from acetyl-CoA and bicarbonate. The enzymes which 
catalyze the remaining reactions are covalently linked into a single polypeptide chain, referred to as the 
multifunctional enzyme fatty acid synthase (FAS). FAS catalyzes the synthesis of palmitate from 
acetyl-CoA and malonyl-CoA. FAS contains acetyl transferase, malonyl transferase, p-ketoacetyl 

20 synthase, acyl carrier protein, p-ketoacyl reductase, dehydratase, enoyl reductase, and thioesterase 
activities. The final product of the FAS reaction is the 1 6-carbon fatty acid palmitate. Further 
elongation, as well as unsaturation, of palmitate by accessory enzymes of the ER produces the variety 
of long chain fatty acids required by the individual cell. These enzymes include a NADH-cytochrome 
b 5 reductase, cytochrome b 5 , and a desaturase. 

25 Phospholipid and Triacvlglvcerol Synthesis 

Triacylglycerols, also known as triglycerides and neutral fats, are major energy stores in 
animals. Triacylglycerols are esters of glycerol with three fatty acid chains. Glycerol-3-phosphate is 
produced from dihydroxyacetone phosphate by the enzyme glycerol phosphate dehydrogenase or from 
glycerol by glycerol kinase. Fatty acid-CoA's are produced from fatty acids by fatty acyl-CoA 

3 o synthetases. Glyercol-3-phosphate is acylated with two fatty acyl-CoA's by the enzyme glycerol 

phosphate acyltransferase to give phosphatidate. Phosphatidate phosphatase converts phosphatidate to 
diacylglycerol, which is subsequently acylated to a triacylglyercol by the enzyme diglyceride 
acyltransferase. Phosphatidate phosphatase and diglyceride acyltransferase form a triacylglyerol 
synthetase complex bound to the ER membrane. 
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A major class of phospholipids are the phosphoglycerides, which are composed of a glycerol 
backbone, two fatty acid chains, and a phosphorylated alcohol. Phosphoglycerides are components of 
cell membranes. Principal phosphoglycerides are phosphatidyl choline, phosphatidyl ethanolamine, 
phosphatidyl serine, phosphatidyl inositol, and diphosphatidyl glycerol. Many enzymes involved in 
5 phosphoglyceride synthesis are associated with membranes (Meyers, R.A. (1995) Molecular Biology 
and Biotechnology , VCH Publishers Inc., New York NY, pp. 494-501). Phosphatidate is converted to 
CDP-diacylglycerol by the enzyme phosphatidate cytidylyltransferase (ExPASy ENZYME EC 
2.7.7.41). Transfer of the diacylglycerol group from CDP-diacylglycerol to serine to yield phosphatidyl 
serine, or to inositol to yield phosphatidyl inositol, is catalyzed by the enzymes CDP-diacylglycerol- 

1 o serine O-phosphatidyltransferase and CDP-diacylglycerol-inositol 3-phosphatidyltransferase, 

respectively (ExPASy ENZYME EC 2.7.8.8; ExPASy ENZYME EC 2.7.8.1 1). The enzyme 
phosphatidyl serine decarboxylase catalyzes the conversion of phosphatidyl serine to phosphatidyl 
ethanolamine, using a pyruvate cofactor (Voelker, D.R. (1997) BiochinL Biophys. Acta 1348:236-244). 
Phosphatidyl choline is formed using diet-derived choline by the reaction of CDP-choline with 1 ,2- 
15 diacylglycerol, catalyzed by diacylglycerol cholinephosphotransferase (ExPASy ENZYME 2.7.8.2). 
Sterol, Steroid, and Isoprenoid Metabolism 

Cholesterol, composed of four fused hydrocarbon rings with an alcohol at one end, moderates 
the fluidity of membranes in which it is incorporated. In addition, cholesterol is used in the synthesis of 
steroid hormones such as Cortisol, progesterone, estrogen, and testosterone. Bile salts derived from 

2 o cholesterol facilitate the digestion of lipids. Cholesterol in the skin forms a barrier that prevents excess 

water evaporation from the body. Farnesyl and geranylgeranyl groups, which are derived from 
cholesterol biosynthesis intermediates, are post-translationally added to signal transduction proteins 
such as ras and protein-targeting proteins such as rab. These modifications are important for the 
activities of these proteins (Guyton, supra : Stryer, supra , pp. 279-280, 691-702, 934). 

2 5 Mammals obtain cholesterol derived from both de novo biosynthesis and the diet. The liver is 

the major site of cholesterol biosynthesis in mammals. Two acetyl-CoA molecules initially condense to 
form acetoacetyl-CoA, catalyzed by a thiolase. Acetoacetyl-CoA condenses with a third acetyl-CoA to 
form hydroxymethylglutaryl-CoA (HMG-CoA), catalyzed by HMG-CoA synthase. Conversion of 
HMG-CoA to cholesterol is accomplished via a series of enzymatic steps known as the mevalonate 

3 o pathway. The rate-limiting step is the conversion of HMG-CoA to mevalonate by HMG-CoA 

reductase. The drug lovastatin, a potent inhibitor of HMG-CoA reductase, is given to patients to 
reduce their serum cholesterol levels. Other mevalonate pathway enzymes include mevalonate kinase, 
phosphomevalonate kinase, diphosphomevalonate decarboxylase, isopentenyldiphosphate isomerase, 
dimethylallyl transferase, geranyl transferase, farnesyl-diphosphate farnesyltransferase, squalene 
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monooxygenase, lanosterol synthase, lathosterol oxidase, and 7-dehydrocholesterol reductase. 

Cholesterol is used in the synthesis of steroid hormones such as Cortisol, progesterone, 
aldosterone, estrogen, and testosterone. First, cholesterol is converted to pregnenolone by cholesterol 
monooxygenases. The other steroid hormones are synthesized from pregnenolone by a series of 
5 enzyme-catalyzed reactions including oxidations, isomerizations, hydroxylations, reductions, and 
demethylations. Examples of these enzymes include steroid A-isomerase, 3p-hydroxy-A 5 -steroid 
dehydrogenase, steroid 21 -monooxygenase, steroid 19-hydroxylase, and 3p-hydroxysteroid 
dehydrogenase. Cholesterol is also the precursor to vitamin D. 

Numerous compounds contain 5 -carbon isoprene units derived from the mevalonate pathway 
10 intermediate isopentenyl pyrophosphate. Isoprenoid groups are found in vitamin K, ubiquinone, retinal, 
dolichol phosphate (a carrier of oligosaccharides needed for N-linked glycosylation), and farnesyl and 
geranylgeranyl groups that modify proteins. Enzymes involved include farnesyl transferase, polyprenyl 
transferases, dolichyl phosphatase, and dolichyl kinase. 
Sphingolipid Metabolism 

15 Sphingolipids are an important class of membrane lipids that contain sphingosine, a long chain 

amino alcohol. They are composed of one long-chain fatty acid, one polar head alcohol, and 
sphingosine or sphingosine derivative. The three classes of sphingolipids are sphingomyelins, 
cerebrosides, and gangliosides. Sphingomyelins, which contain phosphocholine or 
phosphoethanolamine as their head group, are abundant in the myelin sheath surrounding nerve cells. 

2 o Galactocerebrosides, which contain a glucose or galactose head group, are characteristic of the brain. 
Other cerebrosides are found in nonneural tissues. Gangliosides, whose head groups contain multiple 
sugar units, are abundant in the brain, but are also found in nonneural tissues. 

Sphingolipids are built on a sphingosine backbone. Sphingosine is acylated to ceramide by the 
enzyme sphingosine acetyltransferase. Ceramide and phosphatidyl choline are converted to 

2 5 sphingomyelin by the enzyme ceramide choline phosphotransferase. Cerebrosides are synthesized by 

the linkage of glucose or galactose to ceramide by a transferase. Sequential addition of sugar residues 
to ceramide by transferase enzymes yields gangliosides. 
Eicosanoid Metabolism 

Eicosanoids, including prostaglandins, prostacyclin, thromboxanes, and leukotrienes, are 20- 

3 o carbon molecules derived from fatty acids. Eicosanoids are signaling molecules which have roles in 

pain, fever, and inflammation. The precursor of all eicosanoids is arachidonate, which is generated 
from phospholipids by phospholipase A2 and from diacylglycerols by diacylglycerol lipase. 
Leukotrienes are produced from arachidonate by the action of lipoxygenases. Prostaglandin synthase, 
reductases, and isomerases are responsible for the synthesis of the prostaglandins. Prostaglandins have 
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roles in inflammation, blood flow, ion transport, synaptic transmission, and sleep. Prostacyclin and the 
thromboxanes are derived from a precursor prostaglandin by the action of prostacyclin synthase and 
thromboxane synthases, respectively. 
Ketone Body Metabolism 

5 Pairs of acetyl-CoA molecules derived from fatty acid oxidation in the liver can condense to 

form acetoacetyl-CoA, which subsequently forms acetoacetate, D-3-hydroxybutyrate, and acetone. 
These three products are known as ketone bodies. Enzymes involved in ketone body metabolism 
include HMG-CoA synthetase, HMG-CoA cleavage enzyme, D-3-hydroxybutyrate dehydrogenase, 
acetoacetate decarboxylase, and 3-ketoacyl-CoA transferase. Ketone bodies are a normal fuel supply 

10 of the heart and renal cortex. Acetoacetate produced by the liver is transported to cells where the 
acetoacetate is converted back to acetyl-CoA and enters the citric acid cycle. In times of starvation, 
ketone bodies produced from stored triacylglyerols become an important fuel source, especially for the 
brain. Abnormally high levels of ketone bodies are observed in diabetics. Diabetic coma can result if 
ketone body levels become too great. 

15 Lipid Mobilization 

Within cells, fatty acids are transported by cytoplasmic fatty acid binding proteins (Online 
Mendelian Inheritance in Man (OMIM) *1 34650 Fatty Acid-Binding Protein 1, Liver; FABP1). 
Diazepam binding inhibitor (DBI), also known as endozepine and acyl CoA-binding protein, is an 
endogenous y-aminobutyric acid (GABA) receptor ligand which is thought to down-regulate the effects 

20 of GABA DBI binds medium- and long-chain acyl-CoA esters with very high affinity and may 

function as an intracellular carrier of acyl-CoA esters (OMIM *1 25950 Diazepam Binding Inhibitor; 
DBI; PROSITE PDOC00686 Acyl-CoA-binding protein signature). 

Fat stored in liver and adipose triglycerides may be released by hydrolysis and transported in 
the blood. Free fatty acids are transported in the blood by albumin. Triacylglycerols and cholesterol 

25 esters in the blood are transported in lipoprotein particles. The particles consist of a core of 

hydrophobic lipids surrounded by a shell of polar lipids and apolipoproteins. The protein components 
serve in the solubilization of hydrophobic lipids and also contain cell-targeting signals. Lipoproteins 
include chylomicrons, chylomicron remnants, very-low-density lipoproteins (VLDL), intermediate- 
density lipoproteins (IDL), low-density lipoproteins (LDL), and high-density lipoproteins (HDL). 

3 o There is a strong inverse correlation between the levels of plasma HDL and risk of premature coronary 
heart disease. 

Triacylglycerols in chylomicrons and VLDL are hydrolyzed by lipoprotein lipases that line 
blood vessels in muscle and other tissues that use fatty acids. Cell surface LDL receptors bind LDL 
particles which are then internalized by endocytosis. Absence of the LDL receptor, the cause of the 
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disease familial hypercholesterolemia, leads to increased plasma cholesterol levels and ultimately to 
atherosclerosis. Plasma cholesteryl ester transfer protein mediates the transfer of cholesteryl esters 
from HDL to apolipoprotein B-containing lipoproteins. Cholesteryl ester transfer protein is important 
in the reverse cholesterol transport system and may play a role in atherosclerosis (Yamashita, S. et al. 
5 (1997) Curr. Opin. Lipidol. 8:101-1 10). Macrophage scavenger receptors, which bind and internalize 
modified lipoproteins, play a role in lipid transport and may contribute to atherosclerosis (Greaves, 
D.R. et al. (1998) Curr. Opin. Lipidol. 9:425^32). 

Proteins involved in cholesterol uptake and biosynthesis are tightly regulated in response to 
cellular cholesterol levels. The sterol regulatory element binding protein (SREBP) is a sterol-responsive 

10 transcription factor. Under normal cholesterol conditions, SREBP resides in the ER membrane. When 
cholesterol levels are low, a regulated cleavage of SREBP occurs which releases the extracellular 
domain of the protein. This cleaved domain is then transported to the nucleus where it activates the 
transcription of the LDL receptor gene, and genes encoding enzymes of cholesterol synthesis, by 
binding the sterol regulatory element (SRE) upstream of the genes (Yang, J. et al. (1995) J. Biol. Chem. 

is 270:12152-12161). Regulation of cholesterol uptake and biosynthesis also occurs via the oxysterol- 
binding protein (OSBP). OSBP is a high-affinity intracellular receptor for a variety of oxysterols that 
down-regulate cholesterol synthesis and stimulate cholesterol esterification (Lagace, T.A. et al. (1997) 
Biochem. J. 326:205-213). 
Beta-oxidation 

2 o Mitochondrial and peroxisomal beta-oxidation enzymes degrade saturated and unsaturated fatty 

acids by sequential removal of two-carbon units from CoA-activated fatty acids. The main beta- 
oxidation pathway degrades both saturated and unsaturated fatty acids while the auxiliary pathway 
performs additional steps required for the degradation of unsaturated fatty acids. 

The pathways of mitochondrial and peroxisomal beta-oxidation use similar enzymes, but have 

25 different substrate specificities and functions. Mitochondria oxidize short-, medium-, and long-chain 
fatty acids to produce energy for cells. Mitochondrial beta-oxidation is a major energy source for 
cardiac and skeletal muscle. In liver, it provides ketone bodies to the peripheral circulation when 
glucose levels are low as in starvation, endurance exercise, and diabetes (Eaton, S. et al. (1996) 
Biochem. J. 320:345-357). Peroxisomes oxidize medium-, long-, and very-long-chain fatty acids, 

30 dicarboxylic fatty acids, branched fatty acids, prostaglandins, xenobiotics, and bile acid intermediates. 
The chief roles of peroxisomal beta-oxidation are to shorten toxic lipophilic carboxylic acids to 
facilitate their excretion and to shorten very-long-chain fatty acids prior to mitochondrial beta-oxidation 
(Mannaerts, G.P. and P.P. van Veldhoven (1993) Biochimie 75:147-158). 

Enzymes involved in beta-oxidation include acyl CoA synthetase, carnitine acyltransferase, 

32 



WO 01/21836 



PCT/US00/25643 



acyl CoA dehydrogenases, enoyl CoA hydratases, L-3-hydroxyacyl CoA dehydrogenase, p-ketothiolase, 
2,4-dienoyl CoA reductase, and isomer ase. 
Lipid Cleavage and Degradation 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Lysophospholipases 
5 (LPLs) are widely distributed enzymes that metabolize intracellular lipids, and occur in numerous 
isoforms. Small isoforms, approximately 15-30 kD, function as hydrolases; large isoforms, those 
exceeding 60 kD, function both as hydrolases and transacylases. A particular substrate for LPLs, 
lysophosphatidylcholine, causes lysis of cell membranes when it is formed or imported into a cell. 
LPLs are regulated by lipid factors including acylcarnitine, arachidonic acid, and phosphatidic acid. 

10 These lipid factors are signaling molecules important in numerous pathways, including the 

inflammatory response. (Anderson, R. et at. (1994) Toxicol. Appl. Pharmacol. 125:176-183; Selle, H. 
et al. (1993); Eur. J. Biochem. 212:411-416.) 

The secretory phospholipase A 2 (PLA2) superfamily comprises a number of heterogeneous 
enzymes whose common feature is to hydrolyze the sn-2 fatty acid acyl ester bond of 

15 phosphoglycerides. Hydrolysis of the glycerophospholipids releases free fatty acids and 

lysophospholipids. PLA2 activity generates precursors for the biosynthesis of biologically active lipids, 
hydroxy fatty acids, and platelet-activating factor. PLA2 hydrolysis of the sn-2 ester bond in 
phospholipids generates free fatty acids, such as arachidonic acid and lysophospholipids. 
Carbon and Carbohydrate Metabolism 

20 Carbohydrates, including sugars or saccharides, starch, and cellulose, are aldehyde or ketone 

compounds with multiple hydroxyl groups. The importance of carbohydrate metabolism is 
demonstrated by the sensitive regulatory system in place for maintenance of blood glucose levels. Two 
pancreatic hormones, insulin and glucagon, promote increased glucose uptake and storage by cells, and 
increased glucose release from cells, respectively. Carbohydrates have three important roles in 

25 mammalian cells. First, carbohydrates are used as energy stores, fuels, and metabolic intermediates. 
Carbohydrates are broken down to form energy in glycolysis and are stored as glycogen for late* use. 
Second, the sugars deoxyribose and ribose form part of the structural support of DNA and RNA, 
respectively. Third, carbohydrate modifications are added to secreted and membrane proteins and lipids 
as they traverse the secretory pathway. Cell surface carbohydrate-containing macromolecules, 

30 including glycoproteins, glycolipids, and transmembrane proteoglycans, mediate adhesion with other 
cells and with components of the extracellular matrix. The extracellular matrix is comprised of diverse 
glycoproteins, glycosaminoglycans (GAGs), and carbohydrate-binding proteins which are secreted from 
the cell and assembled into an organized meshwork in close association with the cell surface. The 
interaction of the cell with the surrounding matrix profoundly influences cell shape, strength, flexibility, 
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motility, and adhesion. These dynamic properties are intimately associated with signal transduction 
pathways controlling cell proliferation and differentiation, tissue construction, and embryonic 
development. 

Carbohydrate metabolism is altered in several disorders including diabetes mellitus, 
hyperglycemia, hypoglycemia, galactosemia, galactokinase deficiency, and UDP-galactose-4-epimerase 
deficiency (Fauci, A.S. et al. (1998) Harrison's Principles of Internal Medicine , McGraw-Hill, New 
York NY, pp. 2208-2209). Altered carbohydrate metabolism is associated with cancer. Reduced GAG 
and proteoglycan expression is associated with human lung carcinomas (Nackaerts, K. et al. (1997) Int. 
J. Cancer 74:335-345). The carbohydrate determinants sialyl Lewis A and sialyl Lewis X are 
frequently expressed on human cancer cells (Kannagi, R. (1997) Glycoconj. J. 14:577-584). 
Alterations of the N-linked carbohydrate core structure of cell surface glycoproteins are linked to colon 
and pancreatic cancers (Schwarz, R.E. et al. (1996) Cancer Lett. 107:285-291). Reduced expression 
of the Sda blood group carbohydrate structure in cell surface glycolipids and glycoproteins is observed 
in gastrointestinal cancer (Dohi, T. et al. (1996) Int. J. Cancer 67:626-663). (Carbon and 
carbohydrate metabolism is reviewed in Stryer, L. (1995) Biochemistry W.H. Freeman and Company, 
New York NY; Lehninger, A.L. (1982) Principles of Biochemistry Worth Publishers Inc., New York 
NY; and Lodish, H. et al. (1995) Molecular Cell Biology Scientific American Books, New York NY.) 
Glycolysis 

Enzymes of the glycolytic pathway convert the sugar glucose to pyruvate while simultaneously 
producing ATP. The pathway also provides building blocks for the synthesis of cellular components 
such as long-chain fatty acids. After glycolysis, pyrvuate is converted to acetyl-Coenzyme A, which, in 
aerobic organisms, enters the citric acid cycle. Glycolytic enzymes include hexokinase, phosphoglucose 
isomer ase, phosphofructokinase, aldolase, triose phosphate isomerase, glycer aldehyde 3-phosphate 
dehydrogenase, phosphoglycerate kinase, phosphoglyceromutase, enolase, and pyruvate kinase. Of 
these, phosphofructokinase, hexokinase, and pyruvate kinase are important in regulating the rate of 
glycolysis. 
Gluconeogenesis 

Gluconeogenesis is the synthesis of glucose from noncarbohydrate precursors such as lactate 
and amino acids. The pathway, which functions mainly in times of starvation and intense exercise, 
occurs mostly in the liver and kidney. Responsible enzymes include pyruvate carboxylase, 
phosphoenolpyruvate carboxykinase, fructose 1,6-bisphosphatase, and glucose-6-phosphatase. 
Pentose Phosphate Pathway 

Pentose phosphate pathway enzymes are responsible for generating the reducing agent 
NADPH, while at the same time oxidizing glucose-6-phosphate to ribose-5 -phosphate. Ribose-5- 
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phosphate and its derivatives become part of important biological molecules such as ATP, Coenzyme 
A, NAD + , FAD, RNA, and DNA. The pentose phosphate pathway has both oxidative and non- 
oxidative branches. The oxidative branch steps, which are catalyzed by the enzymes glucoses- 
phosphate dehydrogenase, lactonase, and 6-phosphogluconate dehydrogenase, convert glucose-6- 
5 phosphate and NADP + to ribulose-6-phosphate and NADPH. The non-oxidative branch steps, which 
are catalyzed by the enzymes phosphopentose isomerase, phosphopentose epimerase, transketolase, and 
transaldolase, allow the interconversion of three-, four-, five-, six-, and seven-carbon sugars. 
Glucouronate Metabolism 

Glucuronate is a monosaccharide which, in the form of D-glucuronic acid, is found in the 

10 GAGs chondroitin and dermatan. D-glucuronic acid is also important in the detoxification and 

excretion of foreign organic compounds such as phenol. Enzymes involved in glucuronate metabolism 
include UDP-glucose dehydrogenase and glucuronate reductase. 
Disaccharide Metabolism 

Disaccharides must be hydrolyzed to monosaccharides to be digested. Lactose, a disaccharide 

1 5 found in milk, is hydrolyzed to galactose and glucose by the enzyme lactase. Maltose is derived from 
plant starch and is hydrolyzed to glucose by the enzyme maltase. Sucrose is derived from plants and is 
hydrolyzed to glucose and fructose by the enzyme sucrase. Trehalose, a disaccharide found mainly in 
insects and mushrooms, is hydrolyzed to glucose by the enzyme trehalase (OMIM *275360 Trehalase; 
Ruf, J. et al. (1990) J. Biol. Chan. 265:15034-15039). Lactase, maltase, sucrase, and trehalase are 

20 bound to mucosal ceils lining the small intestine, where they participate in the digestion of dietary 

disaccharides. The enzyme lactose synthetase, composed of the catalytic subunit galactosyltransferase 
and the modifier subunit a-lactalbumin, converts UDP-galactose and glucose to lactose in the mammary 
glands. 

Glycogen, Starch, and Chitin Metabolism 

2 5 Glycogen is the storage form of carbohydrates in mammals. Mobilization of glycogen 

maintains glucose levels between meals and during muscular activity. Glycogen is stored mainly in the 
liver and in skeletal muscle in the form of cytoplasmic granules. These granules contain enzymes that 
catalyze the synthesis and degradation of glycogen, as well as enzymes that regulate these processes. 
Enzymes that catalyze the degradation of glycogen include glycogen phosphorylase, a transferase, a- 

30 1,6-glucosidase, and phosphoglucomutase. Enzymes that catalyze the synthesis of gjycogen include 
UDP-glucose pyrophosphorylase, glycogen synthetase, a branching enzyme, and nucleoside 
diphosphokinase. The enzymes of glycogen synthesis and degradation are tightly regulated by the 
hormones insulin, glucagon, and epinephrine. Starch, a plant-derived polysaccharide, is hydrolyzed to 
maltose, maltotriose, and a-dextrin by a-amylase, an enzyme secreted by the salivary glands and 
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pancreas. Chitin is a polysaccharide found in insects and Crustacea. A chitotriosidase is secreted by 
macrophages and may play a role in the degradation of chitin-containing pathogens (Boot, R.G. et al. 
(1995) J. Biol. Chem. 270:26252-26256). 
Peptidoglvcans and Glvcosaminoglvcans 
5 Glycosaminoglycans (GAGs) are anionic linear unbranched polysaccharides composed of 

repetitive disaccharide units. These repetitive units contain a derivative of an amino sugar, either 
glucosamine or galactosamine. GAGs exist free or as part of proteoglycans, large molecules composed 
of a core protein attached to one or more GAGs. GAGs are found on the cell surface, inside cells, and 
in the extracellular matrix. Changes in GAG levels are associated with several autoimmune diseases 

1 o including autoimmune thyroid disease, autoimmune diabetes mellitus, and systemic lupus 

erythematosus (Hansen, C. et al. (1996) Clin. Exp. Rheum. 14 (Suppl. 15):S59-S67). GAGs include 

chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan sulfate, and hyaluronan. 

The GAG hyaluronan (HA) is found in the extracellular matrix of many cells, especially in soft 

connective tissues, and is abundant in synovial fluid (Pitsillides, A. A. et al. (1993) Int. J. Exp. Pathol. 
15 74:27-34). HA seems to play important roles in cell regulation, development, and differentiation 

(Laurent, T.C. and J.R. Fraser (1992) FASEB J. 6:2397-2404). Hyaluronidase is an enzyme that 

degrades HA to oligosaccharides. Hyaluronidases may function in cell adhesion, infection, 

angiogenesis, signal transduction, reproduction, cancer, and inflammation. 

Proteoglycans, also known as peptidoglycans, are found in the extracellular matrix of 
20 connective tissues such as cartilage and are essential for distributing the load in weight-bearing joints. 

Cell-surface-attached proteoglycans anchor cells to the extracellular matrix. Both extracellular and 

cell-surface proteoglycans bind growth factors, facilitating their binding to cell-surface receptors and 

subsequent triggering of signal transduction pathways. 

Amino Acid and Nitrogen Metabolism 

2 5 NH 4 + is assimilated into amino acids by the actions of two enzymes, glutamate 

dehydrogenase and glutamine synthetase. The carbon skeletons of amino acids come from the 
intermediates of glycolysis, the pentose phosphate pathway, or the citric acid cycle. Of the twenty 
amino acids used in proteins, humans can synthesize only thirteen (nonessential amino acids). The 
remaining nine must come from the diet (essential amino acids). Enzymes involved in nonessential 

3 o amino acid biosynthesis include glutamate kinase dehydrogenase, pyrroline carboxylate reductase, 

asparagine synthetase, phenylalanine oxygenase, methionine adenosyltransferase, 
adenosylhomocysteinase, cystathionine P-synthase, cystathionine y-lyase, phosphoglycerate 
dehydrogenase, phosphoserine transaminase, phosphoserine phosphatase, serine 
hydroxylmethyltransferase, and glycine synthase. 
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Metabolism of amino acids takes place almost entirely in the liver, where the amino group is 
removed by aminotransferases (transaminases), for example, alanine aminotransferase. The amino 
group is transferred to a-ketoglutarate to form glutamate. Glutamate dehydrogenase converts 
glutamate to NH 4 + and a-ketoglutarate. NH 4 + is converted to urea by the urea cycle which is 
5 catalyzed by the enzymes arginase, ornithine transcarbamoylase, arginosuccinate synthetase, and 
arginosuccinase. Carbamoyl phosphate synthetase is also involved in urea formation. Enzymes 
involved in the metabolism of the carbon skeleton of amino acids include serine dehydratase, 
asparaginase, glutaminase, propionyl CoA carboxylase, methylmalonyl CoA mutase, branched-chain 
a-keto dehydrogenase complex, isovaleryl CoA dehydrogenase, (J-methylcrotonyl CoA carboxylase, 
10 phenylalanine hydroxylase, p-hydroxylphenylpyruvate hydroxylase, and homogentisate oxidase. 

Polyamines, which include spermidine, putrescine, and spermine, bind tightly to nucleic acids 
and are abundant in rapidly proliferating cells. Enzymes involved in polyamine synthesis include 
ornithine decarboxylase. 

Diseases involved in amino acid and nitrogen metabolism include hyperammonemia, 
1 5 carbamoyl phosphate synthetase deficiency, urea cycle enzyme deficiencies, methylmalonic aciduria, 
maple syrup disease, alcaptonuria, and phenylketonuria. 
Energy Metabolism 

Cells derive energy from metabolism of ingested compounds that may be roughly categorized 
as carbohydrates, fats, or proteins. Energy is also stored in polymers such as triglycerides (fats) and 
20 glycogen (carbohydrates). Metabolism proceeds along separate reaction pathways connected by key 
intermediates such as acetyl coenzyme A (acetyl-CoA). Metabolic pathways feature anaerobic and 
aerobic degradation, coupled with the energy-requiring reactions such as phosphorylation of 
adenosine diphosphate (ADP) to the triphosphate (ATP) or analogous phosphorylations of guanosine 
(GDP/GTP), uridine (UDP/UTP), or cytidine (CDP/CTP). Subsequent dephosphorylation of the 

2 5 triphosphate drives reactions needed for cell maintenance, growth, and proliferation. 

Digestive enzymes convert carbohydrates and sugars to glucose; fructose and galactose are 
converted in the liver to glucose. Enzymes involved in these conversions include galactose- 1- 
phosphate uridyl transferase and UDP-galactose-4 epimerase. In the cytoplasm, glycolysis converts 
glucose to pyruvate in a series of reactions coupled to ATP synthesis. 

3 o Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 

the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
3 5 dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 
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GTP. In oxidative phosphorylation, the transport of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and Pi by the FqF, ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the F 0 F 1 ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
5 reductase, cytochrome b, cytochrome c lt FeS protein, and cytochrome c oxidase. 

Triglycerides are hydrolyzed to fatty acids and glycerol by lipases. Glycerol is then 
phosphorylated to glycerol-3-phosphate by glycerol kinase and glycerol phosphate dehydrogenase, 
and degraded by the glycolysis. Fatty acids are transported into the mitochondria as fatty acyl- 
carnitine esters and undergo oxidative degradation. 

10 In addition to metabolic disorders such as diabetes and obesity, disorders of energy 

metabolism are associated with cancers (Dorward, A. et al. (1997) J. Bioenerg. Biomembr. 29:385- 
392), autism (Lombard, J. (1998) Med. Hypotheses 50:497-500), neurodegenerative disorders (Alexi, 
T. et al. (1998) Neuroreport 9:R57-64), and neuromuscular disorders (DiMauro, S. et al. (1998) 
Biochim. Biophys. Acta 1366:199-210). The myocardium is heavily dependent on oxidative 

15 metabolism, so metabolic dysfunction often leads to heart disease (DiMauro, S. and M. Hirano (1998) 
Curr. Opin. Cardiol. 13:190-197). 

For a review of energy metabolism enzymes and intermediates, see Stryer, L. et al. (1995) 
Biochemistry , W.H. Freeman and Co., San Francisco CA, pp. 443-652. For a review of energy 
metabolism regulation, see Lodish, H. et al. (1995) Molecular Cell Biology . Scientific American 

20 Books, New York NY, pp. 744-770. 
Cofactor Metabolism 

Cofactors, including coenzymes and prosthetic groups, are small molecular weight inorganic 
or organic compounds that are required for the action of an enzyme. Many cofactors contain vitamins 
as a component. Cofactors include thiamine pyrophosphate, flavin adenine dinucleotide, flavin 

25 mononucleotide, nicotinamide adenine dinucleotide, pyridoxal phosphate, coenzyme A, 

tetrahydrofolate, lipoamide, and heme. The vitamins biotin and cobalamin are associated with 
enzymes as well. Heme, a prosthetic group found in myoglobin and hemoglobin, consists of 
protoporphyrin group bound to iron. Porphyrin groups contain four substituted pyrroles covalently 
joined in a ring, often with a bound metal atom. Enzymes involved in porphyrin synthesis include 6- 

30 aminolevulinate synthase, S-aminolevulinate dehydrase, porphobilinogen deaminase, and cosynthase. 
Deficiencies in heme formation cause porphyrias. Heme is broken down as a part of erythrocyte 
turnover. Enzymes involved in heme degradation include heme oxygenase and biliverdin reductase. 

Iron is a required cofactor for many enzymes. Besides the heme-containing enzymes, iron is 
found in iron-sulfur clusters in proteins including aconitase, succinate dehydrogenase, and NADH-Q 

3 5 reductase. Iron is transported in the blood by the protein transferrin. Binding of transferrin to the 
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transferrin receptor on cell surfaces allows uptake by receptor mediated endocytosis. Cytosolic iron is 
bound to ferritin protein. 

A molybdenum-containing cofactor (molybdopterin) is found in enzymes including sulfite 
oxidase, xanthine dehydrogenase, and aldehyde oxidase. Molybdopterin biosynthesis is performed by 
5 two molybdenum cofactor synthesizing enzymes. Deficiencies in these enzymes cause mental 
retardation and lens dislocation. Other diseases caused by defects in cofactor metabolism include 
pernicious anemia and methylmalonic aciduria. 
Secretion and Trafficking 

Eukaryotic cells are bound by a lipid bilayer membrane and subdivided into functionally 

10 distinct, membrane bound compartments. The membranes maintain the essential differences between 
the cytosol, the extracellular environment, and the lumenal space of each intracellular organelle. As 
lipid membranes are highly impermeable to most polar molecules, transport of essential nutrients, 
metabolic waste products, cell signaling molecules, macromolecules and proteins across lipid 
membranes and between organelles must be mediated by a variety of transport-associated molecules. 

15 Protein Trafficking 

In eukaryotes, some proteins are synthesized on ER-bound ribosomes, co-translationally 
imported into the ER, delivered from the ER to the Golgi complex for post-translational processing and 
sorting, and transported from the Golgi to specific intracellular and extracellular destinations. All cells 
possess a constitutive transport process which maintains homeostasis between the cell and its 

2 o environment. In many differentiated cell types, the basic machinery is modified to carry out specific 
transport functions. For example, in endocrine glands, hormones and other secreted proteins are 
packaged into secretory granules for regulated exocytosis to the cell exterior. In macrophage, foreign 
extracellular material is engulfed (phagocytosis) and delivered to lysosomes for degradation. In fat and 
muscle cells, glucose transporters are stored in vesicles which fuse with the plasma membrane only in 

2 5 response to insulin stimulatioa 

The Secretory Pathway 

Synthesis of most integral membrane proteins, secreted proteins, and proteins destined for the 
lumen of a particular organelle occurs on ER-bound ribosomes. These proteins are co-translationally 
imported into the ER. The proteins leave the ER via membrane-bound vesicles which bud off the ER at 

3 o specific sites and fuse with each other (homotypic fusion) to form the ER-Golgi Intermediate 

Compartment (ERGIC). The ERGIC matures progressively through the cis, medial, and trans cisternal 
stacks of the Golgi, modifying the enzyme composition by retrograde transport of specific Golgi 
enzymes. In this way, proteins moving through the Golgi undergo post-translational modification, such 
as glycosylation. The final Golgi compartment is the Trans-Golgi Network (TGN), where both 
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membrane and lumenal proteins are sorted for their final destination. Transport vesicles destined for 
intracellular compartments, such as the lysosome, bud off the TGN. What remains is a secretory 
vesicle which contains proteins destined for the plasma membrane, such as receptors, adhesion 
molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 
5 enzymes. Secretory vesicles eventually fuse with the plasma membrane (Glick, B.S. and V. Malhotra 
(1998) Cell 95:883-889). 

The secretory process can be constitutive or regulated. Most cells have a constitutive pathway 
for secretion, whereby vesicles derived from maturation of the TGN require no specific signal to fuse 
with the plasma membrane. In many cells, such as endocrine cells, digestive cells, and neurons, vesicle 
l o pools derived from the TGN collect in the cytoplasm and do not fuse with the plasma membrane until 
they are directed to by a specific signal. . 
Endocvtosis 

Endocytosis, wherein cells internalize material from the extracellular environment, is essential 
for transmission of neuronal, metabolic, and proliferative signals; uptake of many essential nutrients; 

is and defense against invading organisms. Most cells exhibit two forms of endocytosis. The first, 
phagocytosis, is an actin-driven process exemplified in macrophage and neutrophils. Material to be 
endocytosed contacts numerous cell surface receptors which stimulate the plasma membrane to extend 
and surround the particle, enclosing it in a membrane-bound phagosome. In the mammalian immune 
system, IgG-coated particles bind Fc receptors on the surface of phagocytic leukocytes. Activation of 

20 the Fc receptors initiates a signal cascade involving src-family cytosolic kinases and the monomeric 

GTP-binding (G) protein Rho. The resulting actin reorganization leads to phagocytosis of the particle. 
This process is an important component of the humoral immune response, allowing the processing and 
presentation of bacterial-derived peptides to antigen-specific T-lymphocytes. 

The second form of endocytosis, pinocytosis, is a more generalized uptake of material from the 

25 external milieu. Like phagocytosis, pinocytosis is activated by ligand binding to cell surface receptors. 
Activation of individual receptors stimulates an internal response that includes coalescence of the 
receptor-ligand complexes and formation of clathrin-coated pits. Invagination of the plasma membrane 
at clathrin-coated pits produces an endocytic vesicle within the cell cytoplasm. These vesicles undergo 
homotypic fusion to form an early endosomal (EE) compartment. The tubulovesicular EE serves as a 

3 o sorting site for incoming material. ATP-driven proton pumps in the EE membrane lowers the pH of the 
EE lumen (pH 6.3-6.8). The acidic environment causes many ligands to dissociate from their receptors. 
The receptors, along with membrane and other integral membrane proteins, are recycled back to the 
plasma membrane by budding off the tubular extensions of the EE in recycling vesicles (RV). This 
selective removal of recycled components produces a carrier vesicle containing ligand and other 
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material from the external environment. The carrier vesicle fuses with TGN-derived vesicles which 
contain hydrolytic enzymes. The acidic environment of the resulting late endosome (LE) activates the 
hydrolytic enzymes which degrade the ligands and other material. As digestion takes place, the LE 
fuses with the lysosome where digestion is completed (Mellman, I, (1996) Annu. Rev. Cell Dev. Biol. 
5 12:575-625). 

Recycling vesicles may return directly to the plasma membrane. Receptors internalized and 
returned directly to the plasma membrane have a turnover rate of 2-3 minutes. Some RVs undergo 
microtubule-directed relocation to a perinuclear site, from which they then return to the plasma 
membrane. Receptors following this route have a turnover rate of 5-10 minutes. Still other RVs are 
10 retained within the cell until an appropriate signal is received (Mellman, supra : and James, D.E. et al. 
(1994) Trends Cell Biol. 4:120-126). 
Vesicle Formation 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 

1 5 (tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN), the plasma membrane 
(PM), and tubular extensions of the endosomes. The process begins with the budding of a vesicle out of 
the donor membrane. The membrane-bound vesicle contains proteins to be transported and is 
surrounded by a protective coat made up of protein subunits recruited from the cytosol. The initial 
budding and coating processes are controlled by a cytosolic ras-like GTP-binding protein, ADP- 

20 ribosylating factor (Arf), and adaptor proteins (AP). Different isoforms of both Arf and AP are 

involved at different sites of budding. Another small G-protein, dynamin, forms a ring complex around 
the neck of the forming vesicle and may provide the mechanochemical force to accomplish the final step 
of the budding process. The coated vesicle complex is then transported through the cytosol. During the 
transport process, Arf-bound GTP is hydrolyzed to GDP and the coat dissociates from the transport 

25 vesicle (West, M.A. et al. (1997) J. Cell Biol. 138:1239-1254). Two different classes of coat protein 
have also been identified. Clathrin coats form on the TGN and PM surfaces, whereas coatomer or COP 
coats form on the ER and Golgi. COP coats can further be distinguished as COPI, involved in 
retrograde traffic through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde 
traffic from the ER to the Golgi (Mellman, supra) . The COP coat consists of two major components, a 

30 G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar complex of seven 
proteins, termed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. (Harter, C. and F.T. 
Wieland(1998)Proc. Natl. Acad. Sci. USA 95:1 1649-1 1654.) 
Membrane Fusion 

Transport vesicles undergo homotypic or heterotypic fusion in the secretory and endocytotic 
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pathways. Molecules required for appropriate targeting and fusion of vesicles with their target 
membrane include proteins incorporated in the vesicle membrane, the target membrane, and proteins 
recruited from the cytosol. During budding of the vesicle from the donor compartment, an integral 
membrane protein, VAMP (vesicle-associated membrane protein) is incorporated into the vesicle. Soon 
5 after the vesicle uncoats, a cytosolic prenylated GTP-binding protein, Rab (a member of the Ras 

superfamily), is inserted into the vesicle membrane. GTP-bound Rab proteins are directed into nascent 
transport vesicles where they interact with VAMP. Following vesicle transport, GTPase activating 
proteins (GAPs) in the target membrane convert Rab proteins to the GDP-bound form. A cytosolic 
protein, guanine-nucleotide dissociation inhibitor (GDI) helps return GDP-bound Rab proteins to their 

10 membrane of origin. Several Rab isoforms have been identified and appear to associate with specific 
compartments within the cell. Rab proteins appear to play a role in mediating the function of a viral 
gene, Rev, which is essential for replication of HIV-1, the virus responsible for AIDS (Flavell, R.A. et 
al. (1996) Proc. Natl. Acad. Sci. USA 93:4421-4424). 

Docking of the transport vesicle with the target membrane involves the formation of a complex 

is between the vesicle SNAP receptor (v-SNARE), target membrane (t-) SNAREs, and certain other 
membrane and cytosolic proteins. Many of these other proteins have been identified although their 
exact functions in the docking complex remain uncertain (Tellam, J.T. et al. (1995) J. Biol. Chem. 
270:5857-63; and Hata, Y. andT.C. Sudhof (1995) J. Biol. Chem. 270:13022-28). N-ethylmaleimide 
sensitive factor (NSF) and soluble NSF-attachment protein (ct-SNAP and p-SNAP) are two such 

2 o proteins that are conserved from yeast to man and function in most intracellular membrane fusion 
reactions. Seel represents a family of yeast proteins that function at many different stages in the 
secretory pathway including membrane fusion. Recently, mammalian homologs of Seel, called 
Munc-18 proteins, have been identified (Katagiri, H. et al. (1995) J. Biol. Chem. 270:4963-4966; Hata 
et al. supra) . 

2 5 The SNARE complex involves three SNARE molecules, one in the vesicular membrane and 

two in the target membrane. Synaptotagmin is an integral membrane protein in the synaptic vesicle 
which associates with the t-SNARE syntaxin in the docking complex. Synaptotagmin binds calcium in 
a complex with negatively charged phospholipids, which allows the cytosolic SNAP protein to displace 
synaptotagmin from syntaxin and fusion to occur. Thus, synaptotagmin is a negative regulator of 

3 o fusion in the neuron (Littleton, J.T. et al. ( 1 993) Cell 74: 1 1 25- 1 1 34). The most abundant membrane 

protein of synaptic vesicles appears to be the glycoprotein synaptophysin, a 38 kDa protein with four 
transmembrane domains. 

Specificity between a vesicle and its target is derived from the v-SNARE, t-SNAREs, and 
associated proteins involved. Different isoforms of SNAREs and Rabs show distinct cellular and 
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subcellular distributions. VAMP-l/synaptobrevin, membrane-anchored synaptosome-associated 
protein of 25 kDa (SNAP-25), syntaxin-1, Rab3A, Rabl5 f and Rab23 are predominantly expressed in 
the brain and nervous system. Different syntaxin, VAMP, and Rab proteins are associated with 
distinct subcellular compartments and their vesicular carriers. 
5 Nuclear Transport 

Transport of proteins and RNA between the nucleus and the cytoplasm occurs through nuclear 
pore complexes (NPCs). NPC-mediated transport occurs in both directions through the nuclear 
envelope. All nuclear proteins are imported from the cytoplasm, their site of synthesis. tRNA and 
mRNA are exported from the nucleus, their site of synthesis, to the cytoplasm, their site of function. 

10 Processing of small nuclear RNAs involves export into the cytoplasm, assembly with proteins and 
modifications such as hypermethylation to produce small nuclear ribonuclear proteins (snRNPs), and 
subsequent import of the snRNPs back into the nucleus. The assembly of ribosomes requires the initial 
import of ribosomal proteins from the cytoplasm, their incorporation with RNA into ribosomal 
subunits, and export back to the cytoplasm. (Gorlich, D. and I.W. Mattaj (1996) Science 271:1513- 

15 1518.) 

The transport of proteins and mRNAs across the NPC is selective, dependent on nuclear 
localization signals, and generally requires association with nuclear transport factors. Nuclear 
localization signals (NLS) consist of short stretches of amino acids enriched in basic residues. NLS are 
found on proteins that are targeted to the nucleus, such as the glucocorticoid receptor. The NLS is 

2 o recognized by the NLS receptor, importin, which then interacts with the monomeric GTP-binding 

protein Ran. This NLS protein/receptor/Ran complex navigates the nuclear pore with the help of the 
homodimeric protein nuclear transport factor 2 (NTF2). NTF2 binds the GDP-bound form of Ran and 
to multiple proteins of the nuclear pore complex containing FXFG repeat motifs, such as p62. 
(Paschal, B. et al. (1997) J. Biol. Chem. 272:21534-21539; and Wong, D.H. et at (1997) Mol. Cell 
25 Biol. 17:3755-3767). Some proteins are dissociated before nuclear mRNAs are transported across the 
NPC while others are dissociated shortly after nuclear mRNA transport across the NPC and are 
reimported into the nucleus. 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 

3 o transport or secretion of proteins. For example, abnormal hormonal secretion is linked to disorders 

such as diabetes insipidus (vasopressin), hypo:- and hypoglycemia (insulin, glucagon), Grave's disease 
and goiter (thyroid hormone), and Cushing's and Addison's diseases (adrenocorticotropic hormone, 
ACTH). Moreover, cancer cells secrete excessive amounts of hormones or other biologically active 
peptides. Disorders related to excessive secretion of biologically active peptides by tumor cells include 
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fasting hypoglycemia due to increased insulin secretion from insulinoma-islet cell tumors; hypotension 
due to increased epinephrine and norepinephrine secreted from pheochromocytomas of the adrenal 
medulla and sympathetic paraganglia; and carcinoid syndrome, which is characterized by abdominal 
cramps, diarrhea, and valvular heart disease caused by excessive amounts of vasoactive substances 
5 such as serotonin, bradykinin, histamine, prostaglandins, and polypeptide hormones, secreted from 
intestinal tumors. Biologically active peptides that are ectopically synthesized in and secreted from 
tumor cells include ACTH and vasopressin (lung and pancreatic cancers); parathyroid hormone (lung 
and bladder cancers); calcitonin (lung and breast cancers); and thyroid-stimulating hormone (medullary 
thyroid carcinoma). Such peptides may be useful as diagnostic markers for tumorigenesis (Schwartz, 
10 M.Z. (1997) Semin. Pediatr. Surg. 3:141-146; and Said, S.I. and G.R. Faloona (1975) N. Engl. J. Med. 
293:155-160). 

Defective nuclear transport may play a role in cancer. The BRCA1 protein contains three 
potential NLSs which interact with importin alpha, and is transported into the nucleus by the 
importin/NPC pathway. In breast cancer cells the BRCA1 protein is aberrantly localized in the 
1 5 cytoplasm. The mislocation of the BRCA1 protein in breast cancer cells may be due to a defect in the 
NPC nuclear import pathway (Chen, C.F. et ad. (1996) J. Biol. Chem. 271:32863-32868). 

It has been suggested that in some breast cancers, the tumor-suppressing activity of p53 is 
inactivated by the sequestration of the protein in the cytoplasm, away from its site of action in the cell 
nucleus. Cytoplasmic wild-type p53 was also found in human cervical carcinoma cell lines. (Moll, 
20 U.M. et al. (1992) Proc. Natl. Acad. Sci. USA 89:7262-7266; and Liang, X.H. et aL (1993) Oncogene 
8:2645-2652.) 
Environmental Responses 

Organisms respond to the environment by a number of pathways. Heat shock proteins, 
including hsp 70, hsp60, hsp90, and hsp 40, assist organisms in coping with heat damage to cellular 
25 proteins. 

Aquaporins (AQP) are channels that transport water and, in some cases, nonionic small solutes 
such as urea and glycerol. Water movement is important for a number of physiological processes 
including renal fluid filtration, aqueous humor generation in the eye, cerebrospinal fluid production in 
the brain, and appropriate hydration of the lung. Aquaporins are members of the major intrinsic protein 
30 (MIP) family of membrane transporters (King, L.S. andP, Agre (1996) Annu. Rev. Physiol. 58:619- 
648; Ishibashi, K. et al. (1997) J. Biol. Chem. 272:20782-20786). The study of aquaporins may have 
relevance to understanding edema formation and fluid balance in both normal physiology and disease 
states (King, supra) . Mutations in AQP2 cause autosomal recessive nephrogenic diabetes insipidus 
(OMIM *107777 Aquaporin 2; AQP2). Reduced AQP4 expression in skeletal muscle may be 
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associated with Duchenne muscular dystrophy (Frigeri, A. et al. (1998) J. Clin. Invest. 102:695-703). 

Mutations in AQPO cause autosomal dominant cataracts in the mouse (OMIM *1 54050 Major Intrinsic 

Protein of Lens Fiber; MIP). 

The metallothioneins (MTs) are a group of small (61 amino acids) , cysteine-rich proteins that 
5 bind heavy metals such as cadmium, zinc, mercury, lead, and copper and are thought to play a role in 

metal detoxification or the metabolism and homeostasis of metals. Arsenite-resistance proteins have 

been identified in hamsters that are resistant to toxic levels of arsenite (Rossman, T.G. et al. (1997) 

Mutat. Res. 386:307-314). 

Humans respond to light and odors by specific protein pathways. Proteins involved in light 
10 perception include rhodopsin, transducin, and cGMP phosphodiesterase. Proteins involved in odor 

perception include multiple olfactory receptors. Other proteins are important in human Circadian 

rhythms and responses to wounds. 

Immunity and Host Defense 

All vertebrates have developed sophisticated and complex immune systems that provide 
15 protection from viral, bacterial, fungal and parasitic infections. Included in these systems are the 

processes of humoral immunity, the complement cascade and the inflammatory response (Paul, W.E. 

(1993) Fundamental Immunology , Raven Press, Ltd., New York NY, pp. 1-20). 

The cellular components of the humoral immune system include six different types of 

leukocytes: monocytes, lymphocytes, polymorphonuclear granulocytes (consisting of neutrophils, 
20 eosinophils, and basophils) and plasma cells. Additionally, fragments of megakaryocytes, a seventh 

type of white blood cell in the bone marrow, occur in large numbers in the blood as platelets. 

Leukocytes are formed from two stem cell lineages in bone marrow. The myeloid stem cell 

line produces granulocytes and monocytes and, the lymphoid stem cell produces lymphocytes. 

Lymphoid cells travel to the thymus, spleen and lymph nodes, where they mature and differentiate 
25 into lymphocytes. Leukocytes are responsible for defending the body against invading pathogens. 

Neutrophils and monocytes attack invading bacteria, viruses, and other pathogens and destroy them 

by phagocytosis. Monocytes enter tissues and differentiate into macrophages which are extremely 

phagocytic. Lymphocytes and plasma cells are a part of the immune system which recognizes 

specific foreign molecules and organisms and inactivates them, as well as signals other cells to attack 
30 the invaders. 

Granulocytes and monocytes are formed and stored in the bone marrow until needed. 
Megakaryocytes are produced in bone marrow, where they fragment into platelets and are released 
into the bloodstream. The main function of platelets is to activate the blood clotting mechanism. 
Lymphocytes and plasma cells are produced in various lymphogenous organs, including the lymph 
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nodes, spleen, thymus, and tonsils. 

Both neutrophils and macrophages exhibit chemotaxis towards sites of inflammation. Tissue 
inflammation in response to pathogen invasion results in production of chemo-attractants for 
leukocytes, such as endotoxins or other bacterial products, prostaglandins, and products of leukocytes 
5 or platelets. 

Basophils participate in the release of the chemicals involved in the inflammatory process. 
The main function of basophils is secretion of these chemicals to such a degree that they have been 
referred to as "unicellular endocrine glands". A distinct aspect of basophilic secretion is that the 
contents of granules go directly into the extracellular environment, not into vacuoles as occurs with 

10 neutrophils, eosinophils and monocytes. Basophils have receptors for the Fc fragment of 

immunoglobulin E (IgE) that are not present on other leukocytes. Crosslinking of membrane IgE 
with anti-IgE or other ligands triggers degranulation. 

Eosinophils are bi- or multi-nucleated white blood cells which contain eosinophilic granules. 
Their plasma membrane is characterized by Ig receptors, particularly IgG and IgE. Generally, 

15 eosinophils are stored in the bone marrow until recruited for use at a site of inflammation or invasion. 
They have specific functions in parasitic infections and allergic reactions, and are thought to detoxify 
some of the substances released by mast cells and basophils which cause inflammation. Additionally, 
they phagocytize antigen-antibody complexes and further help prevent spread of the inflammation. 
Macrophages are monocytes that have left the blood stream to settle in tissue. Once 

20 monocytes have migrated into tissues, they do not re-enter the bloodstream. The mononuclear 

phagocyte system is comprised of precursor cells in the bone marrow, monocytes in circulation, and 
macrophages in tissues. The system is capable of very fast and extensive phagocytosis. A 
macrophage may phagocytize over 100 bacteria, digest them and extrude residues, and then survive 
for many more months. Macrophages are also capable of ingesting large particles, including red 

2 5 blood cells and malarial parasites. They increase several-fold in size and transform into macrophages 

that are characteristic of the tissue they have entered, surviving in tissues for several months. 

Mononuclear phagocytes are essential in defending the body against invasion by foreign 
pathogens, particularly intracellular microorganisms such as M. tuberculosis , listeria, leishmania and 
toxoplasma. Macrophages can also control the growth of tumorous cells, via both phagocytosis and 

3 o secretion of hydrolytic enzymes. Another important function of macrophages is that of processing 

antigen and presenting them in a biochemically modified form to lymphocytes. 

The immune system responds to invading microorganisms in two major ways: antibody 
production and cell mediated responses. Antibodies are immunoglobulin proteins produced by 
B-lymphocytes which bind to specific antigens and cause inactivation or promote destruction of the 
3 5 antigen by other cells. Cell-mediated immune responses involve T-lymphocytes (T cells) that react 
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with foreign antigen on the surface of infected host cells. Depending on the type of T cell, the 
infected cell is either killed or signals are secreted which activate macrophages and other cells to 
destroy the infected cell (Paul, supra) . 

T-lymphocytes originate in the bone marrow or liver in fetuses. Precursor cells migrate via 
5 the blood to the thymus, where they are processed to mature into T-lymphocytes. This processing is 
crucial because of positive and negative selection of T cells that will react with foreign antigen and 
not with self molecules. After processing, T cells continuously circulate in the blood and secondary 
lymphoid tissues, such as lymph nodes, spleen, certain epithelium-associated tissues in the 
gastrointestinal tract, respiratory tract and skin. When T-lymphocytes are presented with the 

1 o complementary antigen, they are stimulated to proliferate and release large numbers of activated T 

cells into the lymph system and the blood system. These activated T cells can survive and circulate 
for several days. At the same time, T memory cells are created, which remain in the lymphoid tissue 
for months or years. Upon subsequent exposure to that specific antigen, these memory cells will 
respond more rapidly and with a stronger response than induced by the original antigen. This creates 
is an "immunological memory" that can provide immunity for years. 

There are two major types of T cells: cytotoxic T cells destroy infected host cells, and helper 
T cells activate other white blood cells via chemical signals. One class of helper cell, T H 1 , activates 
macrophages to destroy ingested microorganisms, while another, T H 2, stimulates the production of 
antibodies by B cells. 

20 Cytotoxic T cells directly attack the infected target cell. In virus-infected cells, peptides 

derived from viral proteins are generated by the proteasome. These peptides are transported into the 
ER by the transporter associated with antigen processing (TAP) (Pamer, E. and P. Cresswell (1998) 
Annu. Rev. Immunol. 16:323-358). Once inside the ER, the peptides bind MHC I chains, and the 
peptide/MHC I complex is transported to the cell surface. Receptors on the surface of T cells bind to 

2 5 antigen presented on cell surface MHC molecules. Once activated by binding to antigen, T cells 

secrete Y-interferon, a signal molecule that induces the expression of genes necessary for presenting 
viral (or other) antigens to cytotoxic T cells. Cytotoxic T cells kill the infected cell by stimulating 
programmed cell death. 

Helper T cells constitute up to 75% of the total T cell population. They regulate the immune 

3 o functions by producing a variety of lymphokines that act on other cells in the immune system and on 

bone marrow. Among these lymphokines are: interleukins-2,3,4,5,6; granulocyte-monocyte colony 
stimulating factor, and y-interferon. 

Helper T cells are required for most B cells to respond to antigen. When an activated helper 
cell contacts a B cell, its centrosome and Golgi apparatus become oriented toward the B cell, aiding 
3 5 the directing of signal molecules, such as transmembrane-bound protein called CD40 ligand, onto the 
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B cell surface to interact with the CD40 transmembrane protein. Secreted signals also help B cells to 
proliferate and mature and, in some cases, to switch the class of antibody being produced. 

B-lymphocytes (B cells) produce antibodies which react with specific antigenic proteins 
presented by pathogens. Once activated, B cells become filled with extensive rough endoplasmic 
5 reticulum and are known as plasma cells. As with T cells, interaction of B cells with antigen 

stimulates proliferation of only those B cells which produce antibody specific to that antigen. There 
are five classes of antibodies, known as immunoglobulins, which together comprise about 20% of 
total plasma protein. Each class mediates a characteristic biological response after antigen binding. 
Upon activation by specific antigen B cells switch from making membrane-bound antibody to 

1 0 secretion of that antibody. 

Antibodies, or immunoglobulins (Ig), are the founding members of the Ig superfamily and the 
central components of the humoral immune response. Antibodies are either expressed on the surface 
of B cells or secreted by B cells into the circulation. Antibodies bind and neutralize blood-borne 
foreign antigens. The prototypical antibody is a tetramer consisting of two identical heavy 

15 polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 
disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 
Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 6, e, y 9 and H-chain types. There are two types of L- 
chains, k and A, either of which may associate as a pair with any H-chain pair. IgG, the most 

20 common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 
are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C-terminal constant 
region. Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
contains four Ig domains, three of which occur within the constant region and one of which occurs 

25 within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 
which occurs within the variable region. In addition, H chains such as \i have been shown to 
associate with other polypeptides during differentiation of the B cell. 

Antibodies can be described in terms of their two main functional domains. Antigen 

3 o recognition is mediated by the Fab (antigen binding fragment) region of the antibody, while effector 
functions are mediated by the Fc (crystallizable fragment) region. Binding of antibody to an antigen, 
such as a bacterium, triggers the destruction of the antigen by phagocytic white blood cells such as 
macrophages and neutrophils. These cells express surface receptors that specifically bind to the 
antibody Fc region and allow the phagocytic cells to engulf, ingest, and degrade the antibody-bound 

3 5 antigen. The Fc receptors expressed by phagocytic cells are single-pass transmembrane glycoproteins 
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of about 300 to 400 amino acids (Sears, D.W. et al. (1990) J. Immunol. 144:371-378). The 
extracellular portion of the Fc receptor typically contains two or three Ig domains. 

Diseases which cause over- or under-abundance of any one type of leukocyte usually result in 
the entire immune defense system becoming involved. A well-known autoimmune disease is AIDS 
5 (Acquired Immunodeficiency Syndrome) where the number of helper T cells is depleted, leaving the 
patient susceptible to infection by microorganisms and parasites. Another widespread medical 
condition attributable to the immune system is that of allergic reactions to certain antigens. Allergic 
reactions include: hay fever, asthma, anaphylaxis, and urticaria (hives). Leukemias are an excess 
production of white blood cells, to the point where a major portion of the body's metabolic resources 

10 are directed solely at proliferation of white blood cells, leaving other tissues to starve. Leukopenia or 
agranulocytosis occurs when the bone marrow stops producing white blood cells. This leaves the 
body unprotected against foreign microorganisms, including those which normally inhabit skin, 
mucous membranes, and gastrointestinal tract. If all white blood cell production stops completely, 
infection will occur within two days and death may follow only 1 to 4 days later. 

is Impaired phagocytosis occurs in several diseases, including monocytic leukemia, systemic 

lupus, and granulomatous disease. In such a situation, macrophages can phagocytize normally, but 
the enveloped organism is not killed. A defect in the plasma membrane enzyme which converts 
oxygen to lethally reactive forms results in abscess formation in liver, lungs, spleen, lymph nodes, 
and beneath the skin. Eosinophilia is an excess of eosinophils commonly observed in patients with 

20 allergies (hay fever, asthma), allergic reactions to drugs, rheumatoid arthritis, and cancers (Hodgkin's 
disease, lung, and liver cancer) (Isselbacher, KJ. et al. (1994) Harrison's Principles of Internal 
Medicine , McGraw-Hill, Inc., New York NY). 

Host defense is further augmented by the complement system. The complement system 
serves as an effector system and is involved in infectious agent recognition. It can function as an 

2 5 independent immune network or in conjunction with other humoral immune responses. The 

complement system is comprised of numerous plasma and membrane proteins that act in a cascade of 
reaction sequences whereby one component activates the next. The result is a rapid and amplified 
response to infection through either an inflammatory response or increased phagocytosis. 

The complement system has more than 30 protein components which can be divided into 

3 o functional groupings including modified serine proteases, membrane-binding proteins and regulators 

of complement activation. Activation occurs through two different pathways the classical and the 
alternative. Both pathways serve to destroy infectious agents through distinct triggering mechanisms 
that eventually merge with the involvement of the component C3. 

The classical pathway requires antibody binding to infectious agent antigens. The antibodies 
3 5 serve to define the target and initiate the complement system cascade, culminating in the destruction 
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of the infectious agent. In this pathway, since the antibody guides initiation of the process, the 
complement can be seen as an effector arm of the humoral immune system. 

The alternative pathway of the complement system does not require the presence of pre- 
existing antibodies for targeting infectious agent destruction. Rather, this pathway, through low 
5 levels of an activated component, remains constantly primed and provides surveillance in the non- 
immune host to enable targeting and destruction of infectious agents. In this case foreign material 
triggers the cascade, thereby facilitating phagocytosis or lysis (Paul, supra , pp.9 18-9 19). 

Another important component of host defense is the process of inflammation. Inflammatory 
responses are divided into four categories on the basis of pathology and include allergic 
10 inflammation, cytotoxic antibody mediated inflammation, immune complex mediated inflammation 
and monocyte mediated inflammation. Inflammation manifests as a combination of each of these 
forms with one predominating. 

Allergic acute inflammation is observed in individuals wherein specific antigens stimulate 
IgE antibody production. Mast cells and basophils are subsequently activated by the attachment of 
15 antigen- IgE complexes, resulting in the release of cytoplasmic granule contents such as histamine. 
The products of activated mast cells can increase vascular permeability and constrict the smooth 
muscle of breathing passages, resulting in anaphylaxis or asthma. Acute inflammation is also 
mediated by cytotoxic antibodies and can result in the destruction of tissue through the binding of 
complement-fixing antibodies to cells. The responsible antibodies are of the IgG or IgM types. 

2 o Resultant clinical disorders include autoimmune hemolytic anemia and thrombocytopenia as 

associated with systemic lupus erythematosis. 

Immune complex mediated acute inflammation involves the IgG or IgM antibody types 
which combine with antigen to activate the complement cascade. When such immune complexes 
bind to neutrophils and macrophages they activate the respiratory burst to form protein- and vessel- 
25 damaging agents such as hydrogen peroxide, hydroxyl radical, hypochlorous acid, and chloramines. 
Clinical manifestations include rheumatoid arthritis and systemic lupus erythematosus. 

In chronic inflammation or delayed-type hypersensitivity, macrophages are activated and 
process antigen for presentation to T cells that subsequently produce lymphokines and monokines. 
This type of inflammatory response is likely important for defense against intracellular parasites and 

3 0 certain viruses. Clinical associations include, granulomatous disease, tuberculosis, leprosy, and 

sarcoidosis (Paul, W.E., supra , pp.1017- 1018). 



Extracellular Information Transmission Molecules 

SEQ ID NO:9 encodes, for example, an extracellular information transmission molecule. 
3 5 Intercellular communication is essential for the growth and survival of multicellular 
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organisms, and in particular, for the function of the endocrine, nervous, and immune systems. In 
addition, intercellular communication is critical for developmental processes such as tissue 
construction and organogenesis, in which cell proliferation, cell differentiation, and morphogenesis 
must be spatially and temporally regulated in a precise and coordinated manner. Cells communicate 
5 with one another through the secretion and uptake of diverse types of signaling molecules such as 
hormones, growth factors, neuropeptides, and cytokines. 
Hormones 

Hormones are signaling molecules that coordinately regulate basic physiological processes 
from embryogenesis throughout adulthood. These processes include metabolism, respiration, 

10 reproduction, excretion, fetal tissue differentiation and organogenesis, growth and development, 
homeostasis, and the stress response. Hormonal secretions and the nervous system are tightly 
integrated and interdependent. Hormones are secreted by endocrine glands, primarily the hypothalamus 
and pituitary, the thyroid and parathyroid, the pancreas, the adrenal glands, and the ovaries and testes. 
The secretion of hormones into the circulation is tightly controlled. Hormones are often 

1 5 secreted in diurnal, pulsatile, and cyclic patterns. Hormone secretion is regulated by perturbations in 
blood biochemistry, by other upstream-acting hormones, by neural impulses, and by negative feedback 
loops. Blood hormone concentrations are constantly monitored and adjusted to maintain optimal, 
steady-state levels. Once secreted, hormones act only on those target cells that express specific 
receptors. 

20 Most disorders of the endocrine system are caused by either hyposecretion or hypersecretion of 

hormones. Hyposecretion often occurs when a hormone's gland of origin is damaged or otherwise 
impaired. Hypersecretion often results from the proliferation of tumors derived from hormone-secreting 
cells. Inappropriate hormone levels may also be caused by defects in regulatory feedback loops or in 
the processing of hormone precursors. Endocrine malfunction may also occur when the target cell fails 

25 to respond to the hormone. 

Hormones can be classified biochemically as polypeptides, steroids, eicosanoids, or amines. 
Polypeptides, which include diverse hormones such as insulin and growth hormone, vary in size and 
function and are often synthesized as inactive precursors that are processed intracellularly into mature, 
active forms. Amines, which include epinephrine and dopamine, are amino acid derivatives that 

3 o function in neuroendocrine signaling. Steroids, which include the cholesterol-derived hormones 
estrogen and testosterone, function in sexual development and reproduction. Eicosanoids, which 
include prostaglandins and prostacyclins, are fatty acid derivatives that function in a variety of 
processes. Most polypeptides and some amines are soluble in the circulation where they are highly 
susceptible to proteolytic degradation within seconds after their secretion. Steroids and lipids are 
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insoluble and must be transported in the circulation by carrier proteins. The following discussion will 
focus primarily on polypeptide hormones. 

Hormones secreted by the hypothalamus and pituitary gland play a critical role in endocrine 
function by coordinately regulating hormonal secretions from other endocrine glands in response to 
5 neural signals. Hypothalamic hormones include thyrotropin-releasing hormone, gonadotropin-releasing 
hormone, somatostatin, growth-hormone releasing factor, corticotropin-releasing hormone, substance P, 
dopamine, and prolactin-releasing hormone. These hormones directly regulate the secretion of 
hormones from the anterior lobe of the pituitary. Hormones secreted by the anterior pituitary include 
adrenocorticotropic hormone (ACTH), melanocyte-stimulating hormone, somatotropic hormones such 

10 as growth hormone and prolactin, glycoprotein hormones such as thyroid-stimulating hormone, 

luteinizing hormone (LH), and follicle-stimulating hormone (FSH), P-lipotropin, and P-endorphins. 
These hormones regulate hormonal secretions from the thyroid, pancreas, and adrenal glands, and act 
directly on the reproductive organs to stimulate ovulation and spermatogenesis. The posterior pituitary 
synthesizes and secretes antidiuretic hormone (ADH, vasopressin) and oxytocin. 

15 Disorders of the hypothalamus and pituitary often result from lesions such as primary brain 

tumors, adenomas, infarction associated with pregnancy, hypophyseetomy, aneurysms, vascular 
malformations, thrombosis, infections, immunological disorders, and complications due to head trauma. 
Such disorders have profound effects on the function of other endocrine glands. Disorders associated 
with hypopituitarism include hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 

20 Hand-Schuller-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 

dwarfism. Disorders associated with hyperpituitarism include acromegaly, giantism, and syndrome of 
inappropriate ADH secretion (SIADH), often caused by benign adenomas. 

Hormones secreted by the thyroid and parathyroid primarily control metabolic rates and the 
regulation of serum calcium levels, respectively. Thyroid hormones include calcitonin, somatostatin, 

25 and thyroid hormone. The parathyroid secretes parathyroid hormone. Disorders associated with 
hypothyroidism include goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and 
cretinism. Disorders associated with hyperthyroidism include thyrotoxicosis and its various forms, 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's 

3 o disease. Disorders associated with hypoparathyroidism include Conn disease (chronic hypercalemia) 
leading to bone resorption and parathyroid hyperplasia. 

Hormones secreted by the pancreas regulate blood glucose levels by modulating the rates of 
carbohydrate, fat, and protein metabolism. Pancreatic hormones include insulin, glucagon, amylin, y- 
aminobutyric acid, gastrin, somatostatin, and pancreatic polypeptide. The principal disorder associated 
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with pancreatic dysfunction is diabetes mellitus caused by insufficient insulin activity. Diabetes 
mellitus is generally classified as either Type I (insulin-dependent, juvenile diabetes) or Type II (non- 
insulin-dependent, adult diabetes). The treatment of both forms by insulin replacement therapy is well 
known. Diabetes mellitus often leads to acute complications such as hypoglycemia (insulin shock), 
5 coma, diabetic ketoacidosis, lactic acidosis, and chronic complications leading to disorders of the eye, 
kidney, skin, bone, joint, cardiovascular system, nervous system, and to decreased resistance to 
infection. 

The anatomy, physiology, and diseases related to hormonal function are reviewed in McCance, 
K.L. and S.E. Huether (1994) Pathophysiology: The Biological Basis for Disease in Adults and 
10 Children . Mosby-Year Book, Inc., St. Louis MO; Greenspan, F.S. and J.D. Baxter (1994) Basic and 
Clinical Endocrinology , Appleton and Lange, East Norwalk CT. 
Growth Factors 

Growth factors are secreted proteins that mediate intercellular communication. Unlike 
hormones, which travel great distances via the circulatory system, most growth factors are primarily 

15 local mediators that act on neighboring cells. Most growth factors contain a hydrophobic N-terminal 
signal peptide sequence which directs the growth factor into the secretory pathway. Most growth 
factors also undergo post-translational modifications within the secretory pathway. These 
modifications can include proteolysis, glycosylation, phosphorylation, and intramolecular disulfide bond 
formation. Once secreted, growth factors bind to specific receptors on the surfaces of neighboring 

2 o target cells, and the bound receptors trigger intracellular signal transduction pathways. These signal 
transduction pathways elicit specific cellular responses in the target cells. These responses can include 
the modulation of gene expression and the stimulation or inhibition of cell division, cell differentiation, 
and cell motility. 

Growth factors fall into at least two broad and overlapping classes. The broadest class 
2 5 includes the large polypeptide growth factors, which are wide-ranging in their effects. These factors 
include epidermal growth factor (EGF), fibroblast growth factor (FGF), transforming growth factor-p 
(TGF-[J), insulin-like growth factor (IGF), nerve growth factor (NGF), and platelet-derived growth 
factor (PDGF), each defining a family of numerous related factors. The large polypeptide growth 
factors, with the exception of NGF, act as mitogens on diverse cell types to stimulate wound healing, 
30 bone synthesis and remodeling, extracellular matrix synthesis, and proliferation of epithelial, 

epidermal, and connective tissues. Members of the TGF-P, EGF, and FGF families also function as 
inductive signals in the differentiation of embryonic tissue. NGF functions specifically as a 
neurotrophic factor, promoting neuronal growth and differentiation. 

Another class of growth factors includes the hematopoietic growth factors, which are narrow in 
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their target specificity. These factors stimulate the proliferation and differentiation of blood cells such 
as B-lymphocytes, T-lymphocytes, erythrocytes, platelets, eosinophils, basophils, neutrophils, 
macrophages, and their stem cell precursors. These factors include the colony-stimulating factors (G- 
CSF, M-CSF, GM-CSF, and CSF1-3), erythropoietin, and the cytokines. The cytokines are specialized 
5 hematopoietic factors secreted by cells of the immune system and are discussed in detail below. 

Growth factors play critical roles in neoplastic transformation of cells in vitro and in tumor 
progression in vivo . Overexpression of the large polypeptide growth factors promotes the 
proliferation and transformation of cells in culture. Inappropriate expression of these growth factors 
by tumor cells in vivo may contribute to tumor vascularization and metastasis. Inappropriate activity 

10 of hematopoietic growth factors can result in anemias, leukemias, and lymphomas. Moreover, growth 
factors are both structurally and functionally related to oncoproteins, the potentially cancer-causing 
products of proto-oncogenes. Certain FGF and PDGF family members are themselves homologous to 
oncoproteins, whereas receptors for some members of the EGF, NGF, and FGF families are encoded 
by proto-oncogenes. Growth factors also affect the transcriptional regulation of both proto-oncogenes 

15 and oncosuppressor genes (Pimentel, E. (1994) Handbook of Growth Factors . CRC Press, Ann Arbor 
MI; McKay, I. and L Leigh, eds. (1993) Growth Factors: A Practical Approach . Oxford University 
Press, New York NY; Habenicht, A., ed. (1990) Growth Factors. Differentiation Factors, and 
Cytokines . Springer- Verlag, New York NY). 

In addition, some of the large polypeptide growth factors play crucial roles in the induction of 

20 the primordial germ layers in the developing embryo. This induction ultimately results in the formation 
of the embryonic mesoderm, ectoderm, and endoderm which in turn provide the framework for the 
entire adult body plan. Disruption of this inductive process would be catastrophic to embryonic 
development. 

Small Peptide Factors - Neuropeptides and Vasomediators 

2 5 Neuropeptides and vasomediators (NP/VM) comprise a family of small peptide factors, 

typically of 20 amino acids or less. These factors generally function in neuronal excitation and 
inhibition of vasoconstriction/vasodilation, muscle contraction, and hormonal secretions from the 
brain and other endocrine tissues. Included in this family are neuropeptides and neuropeptide 
hormones such as bombesin, neuropeptide Y, neurotensin, neuromedin N, melanocortins, opioids, 

3 o galanin, somatostatin, tachykinins, urotensin II and related peptides involved in smooth muscle 

stimulation, vasopressin, vasoactive intestinal peptide, and circulatory system-borne signaling 
molecules such as angiotensin, complement, calcitonin, endothelins, formyl-methionyl peptides, 
glucagon, cholecystokinin, gastrin, and many of the peptide hormones discussed above. NP/VMs can 
transduce signals directly, modulate the activity or release of other neurotransmitters and hormones, and 
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act as catalytic enzymes in signaling cascades. The effects of NP/VMs range from extremely brief to 
long-lasting. (Reviewed in Martin, C.R. et al. (1985) Endocrine Physiology, Oxford University Press, 
New York NY, pp. 57-62.) 
Cytokines 

5 Cytokines comprise a family of signaling molecules that modulate the immune system and the 

inflammatory response. Cytokines are usually secreted by leukocytes, or white blood cells, in response 
to injury or infection. Cytokines function as growth and differentiation factors that act primarily on 
cells of the immune system such as B- and T-lymphocytes, monocytes, macrophages, and granulocytes. 
Like other signaling molecules, cytokines bind to specific plasma membrane receptors and trigger 

10 intracellular signal transduction pathways which alter gene expression patterns. There is considerable 
potential for the use of cytokines in the treatment of inflammation and immune system disorders. 

Cytokine structure and function have been extensively characterized in vitro . Most cytokines 
are small polypeptides of about 30 kilodaltons or less. Over 50 cytokines have been identified from 
human and rodent sources. Examples of cytokine subfamilies include the interferons (IFN~a, -p, and - 

15 y)» the interleukins (IL1-IL13), the tumor necrosis factors (TNF-oc and -P), and the chemokines. Many 
cytokines have been produced using recombinant DNA techniques, and the activities of individual 
cytokines have been determined in vitro . These activities include regulation of leukocyte proliferation, 
differentiation, and motility. 

The activity of an individual cytokine in vitro may not reflect the full scope of that cytokine's 

2 o activity in vivo . Cytokines are not expressed individually in vivo but are instead expressed in 

combination with a multitude of other cytokines when the organism is challenged with a stimulus. 
Together, these cytokines collectively modulate the immune response in a manner appropriate for that 
particular stimulus. Therefore, the physiological activity of a cytokine is determined by the stimulus 
itself and by complex interactive networks among co-expressed cytokines which may demonstrate both 

2 5 synergistic and antagonistic relationships. 

Chemokines comprise a cytokine subfamily with over 30 members. (Reviewed in Wells, T. 
N.C. and M.C. Peitsch (1997) J. Leukoc. Biol. 61:545-550.) Chemokines were initially identified as 
chemotactic proteins that recruit monocytes and macrophages to sites of inflammation. Recent evidence 
indicates that chemokines may also play key roles in hematopoiesis and HTV-1 infection. Chemokines 

30 are small proteins which range from about 6-15 kilodaltons in molecular weight. Chemokines are 
further classified as C, CC, CXC, or CX 3 C based on the number and position of critical cysteine 
residues. The CC chemokines, for example, each contain a conserved motif consisting of two 
consecutive cysteines followed by two additional cysteines which occur downstream at 24- and 16- 
residue intervals, respectively (ExPASy PROSITE database, documents PS00472 and PDOC00434). 
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The presence and spacing of these four cysteine residues are highly conserved, whereas the intervening 
residues diverge significantly. However, a conserved tyrosine located about 15 residues downstream of 
the cysteine doublet seems to be important for chemotactic activity. Most of the human genes encoding 
CC chemokines are clustered on chromosome 17, although there are a few examples of CC chemokine 
5 genes that map elsewhere. Other chemokines include lymphotactin (C chemokine); macrophage 
chemotactic and activating factor (MCAF/MCP-1; CC chemokine); platelet factor 4 and EL-8 (CXC 
chemokines); and fractalkine and neurotractin (CX 3 C chemokines). (Reviewed in Luster, A.D. (1998) 
N. Engl. J. Med. 338:436-445.) 

10 Receptor Molecules 

SEQ ID NO:10 and SEQ ID NO:l 1 encode, for example, receptor molecules. 
The term receptor describes proteins that specifically recognize other molecules. The category 
is broad and includes proteins with a variety of functions. The bulk of receptors are cell surface 
proteins which bind extracellular ligands and produce cellular responses in the areas of growth, 

15 differentiation, endocytosis, and immune response. Other receptors facilitate the selective transport of 
proteins out of the endoplasmic reticulum and localize enzymes to particular locations in the cell. The 
term may also be applied to proteins which act as receptors for ligands with known or unknown 
chemical composition and which interact with other cellular components. For example, the steroid 
hormone receptors bind to and regulate transcription of DNA. 

20 Regulation of cell proliferation, cUfferentiation, and migration is important for the formation 

and function of tissues. Regulatory proteins such as growth factors coordinately control these cellular 
processes and act as mediators in cell-cell signaling pathways. Growth factors are secreted proteins 
that bind to specific cell-surface receptors on target cells. The bound receptors trigger intracellular 
signal transduction pathways which activate various downstream effectors that regulate gene 

25 expression, cell division, cell differentiation, cell motility, and other cellular processes. 

Cell surface receptors are typically integral plasma membrane proteins. These receptors 
recognize hormones such as catecholamines; peptide hormones; growth and differentiation factors; 
small peptide factors such as thyrotropin-releasing hormone; galanin, somatostatin, and tachykinins; 
and circulatory system-borne signaling molecules. Cell surface receptors on immune system cells 

20 recognize antigens, antibodies, and major histocompatibility complex (MHC)-bound peptides. Other 
cell surface receptors bind ligands to be internalized by the cell. This receptor-mediated endocytosis 
functions in the uptake of low density lipoproteins (LDL), transferrin, glucose- or mannose-terminal 
glycoproteins, galactose-terminal glycoproteins, immunoglobulins, phosphovitellogenins, fibrin, 
proteinase-inlubitor complexes, plasminogen activators, and thrombospondin (Lodish, H. et al. (1995) 
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Molecular Cell Biology , Scientific American Books, New York NY, p. 723; Mikhailenko, L et al. 
(1997) J. Biol. Chem. 272:6784-6791). 
Receptor Protein Kinases 

Many growth factor receptors, including receptors for epidermal growth factor, 
5 platelet-derived growth factor, fibroblast growth factor, as well as the growth modulator a-thrombin, 
contain intrinsic protein kinase activities. When growth factor binds to the receptor, it triggers the 
autophosphorylation of a serine, threonine, or tyrosine residue on the receptor. These phosphorylated 
sites are recognition sites for the binding of other cytoplasmic signaling proteins. These proteins 
participate in signaling pathways that eventually link the initial receptor activation at the cell surface to 

10 the activation of a specific intracellular target molecule. In the case of tyrosine residue 

autophosphorylation, these signaling proteins contain a common domain referred to as a Src homology 
(SH) domain. SH2 domains and SH3 domains are found in phospholipase C-y, P1-3-K p85 regulatory 
subunit, Ras-GTPase activating protein, and pp6(F src (Lowenstein, E.J. et al. (1992) Cell 70:431-442). 
The cytokine family of receptors share a different common binding domain and include transmembrane 

15 receptors for growth hormone (GH), interleukins, erythropoietin, and prolactin. 

Other receptors and second messenger-binding proteins have intrinsic serine/threonine protein 
kinase activity. These include activin/TGF-p/BMP-superfamily receptors, calcium- and diacylglycerol- 
activated/phospholipid-dependant protein kinase (PK-C), and RNA-dependant protein kinase (PK-R). 
In addition, other serine/threonine protein kinases, including nematode Twitchin, have fibronectin-like, 

2 o immunoglobulin C2-like domains. 

G-Protein Coupled Receptors 

G-protein coupled receptors (GPCRs) are integral membrane proteins characterized by the 
presence of seven hydrophobic transmembrane domains which span the plasma membrane and form a 
bundle of antiparallel alpha (a) helices. These proteins range in size from under 400 to over 1000 
25 amino acids (Strosberg, A.D. (1991) Eur. J. Biochem. 196:1-10; Coughlin, S.R. (1994) Curr. Opin. 
Cell Biol. 6:191-197). The amino-terminus of the GPCR is extracellular, of variable length and often 
glycosylated; the carboxy-terminus is cytoplasmic and generally phosphorylated. Extracellular loops of 
the GPCR alternate with intracellular loops and link the transmembrane domains. The most conserved 
domains of GPCRs are the transmembrane domains and the first two cytoplasmic loops. The 

3 o transmembrane domains account for structural and functional features of the receptor. In most cases, 

the bundle of a helices forms a binding pocket. In addition, the extracellular N-terminal segment or one 
or more of the three extracellular loops may also participate in ligand binding. Ligand binding activates 
the receptor by inducing a conformational change in intracellular portions of the receptor. The 
activated receptor, in turn, interacts with an intracellular heterotrimeric guanine nucleotide binding (G) 
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protein complex which mediates further intracellular signaling activities, generally the production of 
second messengers such as cyclic AMP (cAMP), phospholipase C, inositol triphosphate, or interactions 
with ion channel proteins (Baldwin, J.M. (1994) Curr. Opin. Cell Biol. 6:180-190). 

GPCRs include those for acetylcholine, adenosine, epinephrine and norepinephrine, bombesin, 
5 bradykinin, chemokines, dopamine, endothelin, y-aminobutyric acid (GABA), follicle-stimulating 
hormone (FSH), glutamate, gonadotropin-releasing hormone (GnRH), hepatocyte growth factor, 
histamine, leukotrienes, melanocortins, neuropeptide Y, opioid peptides, opsins, prostanoids, serotonin, 
somatostatin, tachykinins, thrombin, thyrotropin-releasing hormone (TRH), vasoactive intestinal 
polypeptide family, vasopressin and oxytocin, and orphan receptors. 

10 GPCR mutations, which may cause loss of function or constitutive activation, have been 

associated with numerous human diseases (CougWin, supra) . For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene. Rhodopsin is the retinal photoreceptor which is located 
within the discs of the eye rod cell. Parma, J. et al. (1993, Nature 365:649-651) report that somatic 
activating mutations in the thyrotropin receptor cause hyperfunctioning thyroid adenomas and suggest 

1 5 that certain GPCRs susceptible to constitutive activation may behave as protooncogenes. 
Nuclear Receptors 

Nuclear receptors bind small molecules such as hormones or second messengers, leading to 
increased receptor-binding affinity to specific chromosomal DNA elements. In addition the affinity for 
other nuclear proteins may also be altered. Such binding and protein-protein interactions may regulate 
20 and modulate gene expression. Examples of such receptors include the steroid hormone receptors 
family, the retinoic acid receptors family, and the thyroid hormone receptors family. 
Ligand-Gated Receptor Ion Channels 

Ligand-gated receptor ion channels fall into two categories. The first category, extracellular 
ligand-gated receptor ion channels (ELGs), rapidly transduce neurotransmitter-binding events into 

2 5 electrical signals, such as fast synaptic neurotransmission. ELG function is regulated by post- 

translational modification. The second category, intracellular ligand-gated receptor ion channels 
(ILGs), are activated by many intracellular second messengers and do not require post-translational 
modifications) to effect a channel-opening response. 

ELGs depolarize excitable cells to the threshold of action potential generation. In non-excitable 

3 o cells, ELGs permit a limited calcium ion-influx during the presence of agonist. ELGs include channels 

directly gated by neurotransmitters such as acetylcholine, L-glutamate, glycine, ATP, serotonin, 
GABA, and histamine. ELG genes encode proteins having strong structural and functional similarities. 
ILGs are encoded by distinct and unrelated gene families and include receptors for cAMP, cGMP, 
calcium ions, ATP, and metabolites of arachidonic acid. 
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Macrophage Scavenge Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding of 
low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are trimeric 
membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
5 transmembrane domain, a large extracellular domain, and a C-terminal cysteine-rich domain. The 

extracellular domain contains a short spacer domain, an a-helical coiled-coil domain, and a triple helical 
collagenous domain. These receptors have been shown to bind a spectrum of ligands, including 
chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, and 
asbestos (Matsumoto, A. et al. (1990) Proc. Natl. Acad. Sci. USA 87:9133-9137; Elomaa, O. et al. 
10 (1995) Cell 80:603-609). The scavenger receptors are thought to play a key role in atherogenesis by 
mediating uptake of modified LDL in arterial walls, and in host defense by binding bacterial 
endotoxins, bacteria, and protozoa. 
T-Cell Receptors 

T cells play a dual role in the immune system as effectors and regulators, coupling antigen 
1 5 recognition with the transmission of signals that induce cell death in infected cells and stimulate 

proliferation of other immune cells. Although a population of T cells can recognize a wide range of 
different antigens, an individual T cell can only recognize a single antigen and only when it is presented 
to the T cell receptor (TCR) as a peptide complexed with a major histocompatibility molecule (MHC) 
on the surface of an antigen presenting cell. The TCR on most T cells consists of immunoglobulin-like 
20 integral membrane glycoproteins containing two polypeptide subunits, a and p, of similar molecular 
weight. Both TCR subunits have an extracellular domain containing both variable and constant 
regions, a transmembrane domain that traverses the membrane once, and a short intracellular domain 
(Saito, H. et al. (1984) Nature 309:757-762). The genes for the TCR subunits are constructed through 
somatic rearrangement of different gene segments. Interaction of antigen in the proper MHC context 
2 5 with the TCR initiates signaling cascades that induce the proliferation, maturation, and function of 
cellular components of the immune system (Weiss, A. (1991) Annu. Rev. Genet. 25:487-510). 
Rearrangements in TCR genes and alterations in TCR expression have been noted in lymphomas, 
leukemias, autoimmune disorders, and immunodeficiency disorders (Aisenberg, A.C. et al. (1985) N. 
Engl. J. Med. 313:529-533; Weiss, supra) . 

30 

Intracellular Signaling Molecules 

SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID 
NO: 17, and SEQ ID NO: 18 encode, for example, intracellular signaling molecules. 

Intracellular signaling is the general process by which cells respond to extracellular signals 
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(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of a signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in the 
process involve the activation of various cytoplasmic proteins by phosphorylation via protein kinases, 
5 and their deactivation by protein phosphatases, and the eventual translocation of some of these 
activated proteins to the cell nucleus where the transcription of specific genes is triggered. The 
intracellular signaling process regulates all types of cell functions including cell proliferation, cell 
differentiation, and gene transcription, and involves a diversity of molecules including protein kinases 
and phosphatases, and second messenger molecules, such as cyclic nucleotides, calcium-calmodulin, 

10 inositol, and various mitogens, that regulate protein phosphorylation. 
Protein Phosphorylation 

Protein kinases and phosphatases play a key role in the intracellular signaling process by 
controlling the phosphorylation and activation of various signaling proteins. The high energy 
phosphate for this reaction is generally transferred from the adenosine triphosphate molecule (ATP) to 

15 a particular protein by a protein kinase and removed from that protein by a protein phosphatase. 
Protein kinases are roughly divided into two groups: those that phosphorylate tyrosine residues 
(protein tyrosine kinases, PTK) and those that phosphorylate serine or threonine residues 
(serine/threonine kinases, STK). A few protein kinases have dual specificity for serine/threonine and 
tyrosine residues. Almost all kinases contain a conserved 250-300 amino acid catalytic domain 

20 containing specific residues and sequence motifs characteristic of the kinase family (Hardie, G. and S. 
Hanks (1995) The Protein Kinase Facts Books . Vol 1:7-20, Academic Press, San Diego CA). 

STKs include the second messenger dependent protein kinases such as the cyclic- AMP 
dependent protein kinases (PKA), involved in mediating hormone-induced cellular responses; 
calcium-calmodulin (CaM) dependent protein kinases, involved in regulation of smooth muscle 

25 contraction, glycogen breakdown, and neurotransmission; and the mitogen-activated protein kinases 
(MAP) which mediate signal transduction from the cell surface to the nucleus via phosphorylation 
cascades. Altered PKA expression is implicated in a variety of disorders and diseases including 
cancer, thyroid disorders, diabetes, atherosclerosis, and cardiovascular disease (Isselbacher, K.J. et al. 
(1994) Harrison's Principles of Internal Medicine McGraw-Hill, New York NY, pp. 416-431, 1887). 

3 o PTKs are divided into transmembrane, receptor PTKs and nontransmembrane, non-receptor 

PTKs. Transmembrane PTKs are receptors for most growth factors. Non-receptor PTKs lack 
transmembrane regions and, instead, form complexes with the intracellular regions of cell surface 
receptors. Receptors that function through non-receptor PTKs include those for cytokines and 
hormones (growth hormone and prolactin) and antigen-specific receptors on T and B lymphocytes. 

3 5 Many of these PTKs were first identified as the products of mutant oncogenes in cancer cells in which 
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their activation was no longer subject to normal cellular controls. In fact, about one third of the 
known oncogenes encode PTKs, and it is well known that cellular transformation (oncogenesis) is 
often accompanied by increased tyrosine phosphorylation activity (Charbonneau, H. and N.K Tonks 
(1992) Annu. Rev. Cell Biol. 8:463-493). 
5 An additional family of protein kinases previously thought to exist only in procaryotes is the 

histidine protein kinase family (HPK). HPKs bear little homology with mammalian STKs or PTKs 
but have distinctive sequence motifs of their own (Davie, J.R. et al. (1995) J. Biol. Chem. 
270:19861-19867). A histidine residue in the N-terminal half of the molecule (region I) is an 
autophosphorylation site. Three additional motifs located in the C-terminal half of the molecule 

10 include an invariant asparagine residue in region II and two glycine-rich loops characteristic of 
nucleotide binding domains in regions III and IV. Recently a branched chain alpha-ketoacid 
dehydrogenase kinase has been found with characteristics of HPK in rat (Davie, supra) . 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 
from molecules previously activated by kinases. The two principal categories of protein phosphatases 

15 are the protein (serine/threonine) phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses (Cohen, P. (1989) Annu. Rev. Biochem. 58:453-508). PTPs 
reverse the effects of protein tyrosine kinases and play a significant role in cell cycle and cell 
signaling processes (Charbonneau, supra) . As previously noted, many PTKs are encoded by 

20 oncogenes, and oncogenesis is often accompanied by increased tyrosine phosphorylation activity. It 
is therefore possible that PTPs may prevent or reverse cell transformation and the growth of various 
cancers by controlling the levels of tyrosine phosphorylation in cells. This hypothesis is supported by 
studies showing that overexpression of PTPs can suppress transformation in cells, and that specific 
inhibition of PTPs can enhance cell transformation (Charbonneau, supra) . 

25 Phospholipid and Inositol-Phosphate Signaling 

Inositol phospholipids (phosphoinositides) are involved in an intracellular signaling pathway 
that begins with binding of a signaling molecule to a G-protein linked receptor in the plasma 
membrane. This leads to the phosphorylation of phosphatidylinositol (PI) residues on the inner side 
of the plasma membrane to the biphosphate state (PIP^ by inositol kinases. Simultaneously, the G- 

30 protein linked receptor binding stimulates a trimeric G-protein which in turn activates a 

phosphoinositide-specific phospholipase C-p. Phospholipase C-(l then cleaves PIP 2 into two 
products, inositol triphosphate (IP 3 ) and diacylglycerol. TTiese two products act as mediators for 
separate signaling events. IP 3 diffuses through the plasma membrane to induce calcium release from 
the endoplasmic reticulum (ER), while diacylglycerol remains in the membrane and helps activate 

3 5 protein kinase C, an STK that phosphorylates selected proteins in the target cell. The calcium 
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response initiated by IP 3 is terminated by the dephosphorylation of BP 3 by specific inositol 
phosphatases. Cellular responses that are mediated by this pathway are glycogen breakdown in the 
liver in response to vasopressin, smooth muscle contraction in response to acetylcholine, and 
thrombin-induced platelet aggregation. 
* 5 Cyclic Nucleotide Signaling 

Cyclic nucleotides (cAMP and cGMP) function as intracellular second messengers to 
transduce a variety of extracellular signals including hormones, light, and neurotransmitters. In 
particular, cyclic- AMP dependent protein kinases (PKA) are thought to account for all of the effects 
of cAMP in most mammalian cells, including various hormone-induced cellular responses. Visual 

10 excitation and the phototransmission of light signals in the eye is controlled by cyclic-GMP regulated, 
Ca 2+ -specific channels. Because of the importance of cellular levels of cyclic nucleotides in 
mediating these various responses, regulating the synthesis and breakdown of cyclic nucleotides is an 
important matter. Thus adenylyl cyclase, which synthesizes cAMP from AMP, is activated to 
increase cAMP levels in muscle by binding of adrenaline to p-andrenergic receptors, while activation 

15 of guanylate cyclase and increased cGMP levels in photoreceptors leads to reopening of the 

Ca 2+ -specific channels and recovery of the dark state in the eye. In contrast, hydrolysis of cyclic 
nucleotides by cAMP and cGMP-specific phosphodiesterases (PDEs) produces the opposite of these 
and other effects mediated by increased cyclic nucleotide levels. PDEs appear to be particularly 
important in the regulation of cyclic nucleotides, considering the diversity found in this family of 

20 proteins. At least seven families of mammalian PDEs (PDE1-7) have been identified based on 

substrate specificity and affinity, sensitivity to cofactors, and sensitivity to inhibitory drugs (Beavo, 
J.A. (1995) Physiological Reviews 75:725-48). PDE inhibitors have been found to be particularly 
useful in treating various clinical disorders. Rolipram, a specific inhibitor of PDE4, has been used in 
the treatment of depression, and similar inhibitors are undergoing evaluation as anti-inflammatory 

2 5 agents. Theophylline is a nonspecific PDE inhibitor used in the treatment of bronchial asthma and 

other respiratory diseases (Banner, K.H. and CP. Page (1995) Eur. Respir. J. 8:996-1000). 
G-Protein Signaling 

Guanine nucleotide binding proteins (G-proteins) are critical mediators of signal transduction 
between a particular class of extracellular receptors, the G-protein coupled receptors (GPCR), and 
30 intracellular second messengers such as cAMP and Ca 2+ . G-proteins are linked to the cytosolic side 
of a GPCR such that activation of the GPCR by ligand binding stimulates binding of the G-protein to 
GTP, inducing an "active" state in the G-protein. In the active state, the G-protein acts as a signal to 
trigger other events in the cell such as the increase of cAMP levels or the release of Ca 2 * into the 
cytosol from the ER, which, in turn, regulate phosphorylation and activation of other intracellular 

3 5 proteins. Recycling of the G-protein to the inactive state involves hydrolysis of the bound GTP to 
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GDP by a GTPase activity in the G-protein. (See Alberts, B. et al. (1994) Molecular Biology of the 
Cell, Garland Publishing, Inc., New York NY, pp.734-759.) Two structurally distinct classes of G- 
proteins are recognized: heterotrimeric G-proteins, consisting of three different subunits, and 

s 

monomeric, low molecular weight (LMW), G-proteins consisting of a single polypeptide chain. 
; 5 The three polypeptide subunits of heterotrimeric G-proteins are the a, p, and y subunits. The 

a subunit binds and hydrolyzes GTP. The P and y subunits form a tight complex that anchors the 
protein to the inner side of the plasma membrane. The p subunits, also known as G-p proteins or p 
transducins, contain seven tandem repeats of the WD-repeat sequence motif, a motif found in many 
proteins with regulatory functions. Mutations and variant expression of p transducin proteins are 

10 linked with various disorders (Neer, E.J. et al, (1994) Nature 371 :297-300; Margottin, F. et al. (1998) 
Mol. Cell 1:565-574). 

LMW GTP-proteins are GTPases which regulate cell growth, cell cycle control, protein 
secretion, and intracellular vesicle interaction. They consist of single polypeptides which, like the a 
subunit of the heterotrimeric G-proteins, are able to bind and hydrolyze GTP, thus cycling between an 

15 inactive and an active state. At least sixty members of the LMW G-protein superfamily have been 
identified and are currently grouped into the six subfamilies of ras, rho, arf , sari , ran, and rab. 
Activated ras genes were initially found in human cancers, and subsequent studies confirmed that ras 
function is critical in determining whether cells continue to grow or become differentiated. Other 
members of the LMW G-protein superfamily have roles in signal transduction that vary with the 

20 function of the activated genes and the locations of the G-proteins. 

Guanine nucleotide exchange factors regulate the activities of LMW G-proteins by 
determining whether GTP or GDP is bound. GTPase-activating protein (GAP) binds to GTP-ras and 
induces it to hydrolyze GTP to GDP. In contrast, guanine nucleotide releasing protein (GNRP) binds 
to GDP-ras and induces the release of GDP and the binding of GTP. 

25 Other regulators of G-protein signaling (RGS) also exist that act primarily by negatively 

regulating the G-protein pathway by an unknown mechanism (Druey, K.M. et al. (1996) Nature 
379:742-746). Some 15 members of the RGS family have been identified. RGS family members are 
related structurally through similarities in an approximately 120 amino acid region termed the RGS 
r domain and functionally by their ability to inhibit the interleukin (cytokine) induction of MAP kinase 

30 in cultured mammalian 293T cells (Druey, supra) , 
i Calcium Signaling Molecules 

Ca +2 is another second messenger molecule that is even more widely used as an intracellular 
mediator than cAMP. Two pathways exist by which Ca +2 can enter the cytosol in response to 
extracellular signals: One pathway acts primarily in nerve signal transduction where Ca enters a 
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nerve terminal through a voltage-gated Ca +2 channel. The second is a more ubiquitous pathway in 
which Ca +2 is released from the ER into the cytosol in response to binding of an extracellular 
signaling molecule to a receptor. Ca 2+ directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. Ca 2+ also binds to specific Ca 2+ -binding proteins (CBPs) 
5 such as calmodulin (CaM) which then activate multiple target proteins in the cell including enzymes, 
membrane transport pumps, and ion channels. CaM interactions are involved in a multitude of cellular 
processes including, but not limited to, gene regulation, DNA synthesis, cell cycle progression, 
mitosis, cytokinesis, cytoskeletal organization, muscle contraction, signal transduction, ion 
homeostasis, exocytosis, and metabolic regulation (Celio, M.R. et al. (1996) Guidebook to 

10 Calcium-binding Proteins , Oxford University Press, Oxford, UK, pp. 15-20). Some CBPs can serve 
as a storage depot for Ca 2+ in an inactive state. Calsequestrin is one such CBP that is expressed in 
isoforms specific to cardiac muscle and skeletal muscle. It is suggested that calsequestrin binds Ca 2+ 
in a rapidly exchangeable state that is released during Ca 2+ -signaling conditions (Celio, M.R. et al. 
(1996) Guidebook to Calcium-binding Proteins , Oxford University Press, New York NY, pp. 222- 

15 224). 
Cvclins 

Cell division is the fundamental process by which all living things grow and reproduce. In 
most organisms, the cell cycle consists of three principle steps; interphase, mitosis, and cytokinesis. 
Interphase, involves preparations for cell division, replication of the DNA and production of essential 

20 proteins. In mitosis, the nuclear material is divided and separates to opposite sides of the cell. 
Cytokinesis is the final division and fission of the cell cytoplasm to produce the daughter cells. 

The entry and exit of a cell from mitosis is regulated by the synthesis and destruction of a 
family of activating proteins called cyclins. Cyclins act by binding to and activating a group of 
cyclin-dependent protein kinases (Cdks) which then phosphorylate and activate selected proteins 

25 involved in the mitotic process. Several types of cyclins exist. (Ciechanover, A. (1994) Cell 

79:13-21.) Two principle types are mitotic cyclin, or cyclin B, which controls entry of the cell into 
mitosis, and Gl cyclin, which controls events that drive the cell out of mitosis. 
Signal Complex Scaffolding Proteins 

Ceretain proteins in intracellular signaling pathways serve to link or cluster other proteins 

3 o involved in the signaling cascade. A conserved protein domain called the PDZ domain has been 
identified in various membrane-associated signaling proteins. This domain has been implicated in 
receptor and ion channel clustering and in the targeting of multiprotein signaling complexes to 
specialized functional regions of the cytosolic face of the plasma membrane. (For a review of PDZ 
domain-containing proteins, see Ponting, CP. et al. (1997) Bioessays 19:469-479.) A large 
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proportion of PDZ domains are found in the eukaryotic MAGUK (membrane-associated guanylate 
kinase) protein family, members of which bind to the intracellular domains of receptors and channels. 
However, PDZ domains are also found in diverse membrane-localized proteins such as protein 
tyrosine phosphatases, serine/threonine kinases, G-protein cofactors, and synapse-associated proteins 
i 5 such as syntrophins and neuronal nitric oxide synthase (nNOS). Generally, about one to three PDZ 

domains are found in a given protein, although up to nine PDZ domains have been identified in a 
single protein. 

Membrane Transport Molecules 

10 The plasma membrane acts as a barrier to most molecules. Transport between the cytoplasm 

and the extracellular environment, and between the cytoplasm and lumenal spaces of cellular 
organelles requires specific transport proteins. Each transport protein carries a particular class of 
molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular species of 
the class. A variety of human inherited diseases are caused by a mutation in a transport protein. For 

1 5 example, cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-linked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 
urine leads to the formation of cystine stones in the kidneys. 

Transport proteins are multi-pass transmembrane proteins, which either actively transport 
molecules across the membrane or passively allow them to cross. Active transport involves 

20 directional pumping of a solute across the membrane, usually against an electrochemical gradient 
Active transport is tightly coupled to a source of metabolic energy, such as ATP hydrolysis or an 
electrochemically favorable ion gradient. Passive transport involves the movement of a solute down 
its electrochemical gradient. Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a specific 

2 5 solute to be transported and undergo a conformational change which transfers the bound solute across 

the membrane. Channel proteins, which only function in passive transport, form hydrophilic pores 
across the membrane. When the pores open, specific solutes, such as inorganic ions, pass through the 
membrane and down the electrochemical gradient of the solute. 

Carrier proteins which transport a single solute from one side of the membrane to the other 
30 are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 

r 

opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 

3 5 sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
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Na7K* ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1), 
iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have twelve 
putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically-oriented N- 
and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of various thyroid 
•i 5 pathologies because it is the molecular basis for radioiodide thyroid-imaging techniques and for 

specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. Natl. Acad. Sci. 
USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and placenta, and is 
implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate (Prasad, P.D. et 
al. (1998) J. Biol. Chem. 273:7501-7506). 

10 Transporters play a major role in the regulation of pH, excretion of drugs, and the cellular 

K 4 /Na + balance. Monocarboxylate anion transporters are proton-coupled symporters with a broad 
substrate specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 
to have twelve transmembrane (TM) helical domains with a large intracellular loop between TM6 and 

1 5 TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 
stoichiometrically with lactate during glycolysis. The best characterized H(+)-monocarboxylate 
transporter is that of the erythrocyte membrane, which transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess H(+)-linked monocarboxylate transporters with 
differing substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have 

2 0 transporters that differ in their K m values for certain substrates, including stereoselectivity for L- over 
D-lactate, and in their sensitivity to inhibitors. There are Na(+)-monocarboxylate cotransporters on the 
luminal surface of intestinal and kidney epithelia, which allow the uptake of lactate, pyruvate, and 
ketone bodies in these tissues. In addition, there are specific and selective transporters for organic 
cations and organic anions in organs including the kidney, intestine and liver. Organic anion 

2 5 transporters are selective for hydrophobic, charged molecules with electron-attracting side groups. 

Organic cation transporters, such as the ammonium transporter, mediate the secretion of a variety of 
drugs and endogenous metabolites, and contribute to the maintenance of intercellular pH. (Poole, R.C. 
and A.P. Halestrap (1993) Am. J. Physiol. 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 
329:321-328; and Martinelle, KL and I. Haggstrom (1993) J. Biotechnol. 30: 339-350.) 

3 o The largest and most diverse family of transport proteins known is the ATP-binding cassette 
> (ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 

chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
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energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD, These "half-molecules" form 
5 homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 

histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects in 
ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 

10 (sulfonylurea receptor, SUR). Ova-expression of the multidrug resistance (MDR) protein, another 

ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used in 
chemotherapy (Taglight, D. and S. Michaelis (1998) Meth. Enzymol. 292:131-163). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 

1 5 acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane segments, 
is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as muscle, heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et al. (1998) J. 

20 Biol. Chem. 273:27420-27429). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form an ion- 
selective pore within the membrane. There are two basic types of ion channels, ion transporters and 

2 5 gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 

transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient undo* restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 
conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 

30 gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ -ATPase, and H + -ATPase, are activated by a 
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phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration of 
K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
5 the lumen of these organelles that is required for function. The coupling factor (F) class consists of H + 
pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 
ADP and inorganic phosphate (Pi). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 

10 the cell. Amino acid and glucose transport into many cells is linked to sodium ion co- transport 

(symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 
with transport of Na + into the cell (antiport). 

Ion channels share common structural and mechanistic themes. The channel consists of four or 

1 5 five subunits or protein monomers that are arranged like a barrel in the plasma membrane. Each 

subunit typically consists of six potential transmembrane segments (SI, S2, S3, S4, S5, and S6). The 
center of the barrel forms a pore lined by a-helices or p-strands. The side chains of the amino acid 
residues comprising the a-helices or p-strands establish the charge (cation or anion) selectivity of the 
channel. The degree of selectivity, or what specific ions are allowed to pass through the channel, 

2 o depends on the diameter of the narrowest part of the pore. 

Gated ion channels control ion flow by regulating the opening and closing of pores. These 
channels are categorized according to the manner of regulating the gating function. Mechanically-gated 
channels open pores in response to mechanical stress, voltage-gated channels open pores in response to 
changes in membrane potential, and ligand-gated channels open pores in the presence of a specific ion, 
25 nucleotide, or neurotransmitter. 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable cells, 
such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and muscle 
contraction, arise from large, transient changes in the permeability of the membrane to Na + and K + ions. 
Depolarization of the membrane beyond the threshold level opens voltage-gated Na + channels. Sodium 

3 o ions flow into the cell, further depolarizing the membrane and opening more voltage-gated Na + 

channels, which propagates the depolarization down the length of the cell. Depolarization also opens 
voltage-gated potassium channels. Consequently, potassium ions flow outward, which leads to 
repolarization of the membrane. Voltage-gated channels utilize charged residues in the fourth 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 
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which time the channel spontaneously converts into an inactive state that cannot be opened irrespective 
of the membrane potential. Inactivation is mediated by the channel's N-terminus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

5 Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore forming 

a subunit that associates with two smaller auxiliary subunits, pi and P2. The P2 subunit is an integral 
membrane glycoprotein that contains an extracellular Ig domain, and its association with a and pi 
subunits correlates with increased functional expression of the channel, a change in its gating 
properties, and an increase in whole cell capacitance due to an increase in membrane surface area. 

10 (Isom, L.L. et al. (1995) Cell 83:433-442.) 

Voltage-gated Ca 2+ channels are involved in presynaptic neurotransmitter release, and heart 
and skeletal muscle contraction. The voltage-gated Ca 2+ channels from skeletal muscle (L-type) and 
brain (N-type) have been purified, and though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The subunit forms the 

is membrane pore and voltage sensor, while the c^S and p subunits modulate the voltage-dependence, 

gating properties, and the current amplitude of the channel. These subunits are encoded by at least six 
c^, one OiS, and four p genes. A fourth subunit, y, has been identified in skeletal muscle. (Walker, D. 
et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:490-492.) 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 

2 o organelle pH. In secretory epithelial cells, CI enters the cell across a basolateral membrane through an 
Na + , K7C1" cotransporter, accumulating in the cell above its electrochemical equilibrium concentration. 
Secretion of CI " from the apical surface, in response to hormonal stimulation, leads to flow of Na + and 
water into the secretory lumen. The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 

2 5 Loss of CFTR function decreases transepithelial water secretion and, as a result, the layers of mucus 

that coat the respiratory tree, pancreatic ducts, and intestine are dehydrated and difficult to clear. The 
resulting blockage of these sites leads to pancreatic insufficiency, "meconium ileus", and devastating 
"chronic obstructive pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

Many intracellular organelles contain H + - ATPase pumps that generate transmembrane pH and 

3 o electrochemical differences by moving protons from the cytosol to the organelle lumen. If the 

membrane of the organelle is permeable to other ions, then the electrochemical gradient can be 
abrogated without affecting the pH differential. In fact, removal of the electrochemical barrier allows 
more H + to be pumped across the membrane, increasing the pH differential. CI " is the sole counterion 
of H + translocation in a number of organelles, including chromaffin granules, Golgi vesicles, 
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lysosomes, and endosomes. Functions that require a low vacuolar pH include uptake of small 
molecules such as biogenic amines in chromaffin granules, processing of vacuolar constituents such as 
pro-hormones by proteolytic enzymes, and protein degradation in lysosomes (Al-Awqati, supra) . 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
5 the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter binds to 
their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle cells. 
There are two types of neurotransmitter-gated channels. Sodium channels open in response to 
excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 

1 o and starts the action potential. Chloride channels open in response to inhibitory newotransmitters, such 

as y-aminobutyric acid (GABA) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. 

Ligand-gated channels can be regulated by intracellular second messengers. Calcium-activated 
K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium during 
15 depolarization opens K + channels to modulate the magnitude of the action potential (Ishi, T.M. et al. 
(1997) Proc. Natl. Acad. Sci. USA 94:11651-11656). Cyclic nucleotide-gated (CNG) channels are 
gated by cytosolic cyclic nucleotides. The best examples of these are the cAMP-gated Na + channels 
involved in olfaction and the cGMP-gated cation channels involved in vision. Both systems involve 
ligand-mediated activation of a G-protein coupled receptor which then alters the level of cyclic 

2 o nucleotide within the cell. 

Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, are 
also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and brain. Calcium-activated K + 
channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for shunting 

2 5 excess K + from brain capillary endothelial cells into the blood. They are also implicated in repolarizing 
granulocytes after agonist-stimulated depolarization (Ishi, supra) . Ion channels have been the target for 
many drug therapies. Neurotransmitter-gated channels have been targeted in therapies for treatment of 
insomnia, anxiety, depression, and schizophrenia. Voltage-gated channels have been targeted in 
therapies for arrhythmia, ischemic stroke, head trauma, and neurodegenerative disease (Taylor, CP. 

30 and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g. cystic fibrosis, glucose-galactose 
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malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

Protein Modification and Maintenance Molecules 

SEQ ID NO:34 encodes, for example, a protein modification and maintenance molecule. 

The cellular processes regulating modification and maintenance of protein molecules 
coordinate their conformation, stabilization, and degradation. Each of these processes is mediated by 
key enzymes or proteins such as proteases, protease inhibitors, transferases, isomerases, and 
molecular chaperones. 
Proteases 

Proteases cleave proteins and peptides at the peptide bond that forms the backbone of the 
peptide and protein chain. Proteolytic processing is essential to cell growth, differentiation, 
remodeling, and homeostasis as well as inflammation and immune response. Typical protein half- 
lives range from hours to a few days, so that within all living cells, precursor proteins are being 
cleaved to their active form, signal sequences proteolytically removed from targeted proteins, and 
aged or defective proteins degraded by proteolysis. Proteases function in bacterial, parasitic, and viral 
invasion and replication within a host Four principal categories of mammalian proteases have been 
identified based on active site structure, mechanism of action, and overall three-dimensional structure. 
(Beynon, R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach. Oxford University 
Press, New York NY, pp. 1-5). 

The serine proteases (SPs) have a serine residue, usually within a conserved sequence, in an 
active site composed of the serine, an aspartate, and a histidine residue. SPs include the digestive 
enzymes trypsin and chymotrypsin, components of the complement cascade and the blood-clotting 
cascade, and enzymes that control extracellular protein degradation. The main SP sub-families are 
trypases, which cleave after arginine or lysine; aspartases, which cleave after aspartate; chymases, 
which cleave after phenylalanine or leucine; metases, which cleavage after methionine; and serases 
which cleave after serine. Enterokinase, the initiator of intestinal digestion, is a serine protease found 
in the intestinal brush border, where it cleaves the acidic propeptide from trypsinogen to yield active 
trypsin (Kitamoto, Y. et al. (1994) Proc. Natl. Acad. Sci. USA 91:7588-7592). 
Procarboxypeptidase, a lysosomal serine peptidase that cleaves peptides such as angiotensin II and 
III and [des- Arg9] bradykinin, shares sequence homology with members of both the serine 



71 



WO 01/21836 



PCT/US00/25643 



carboxypeptidase andprolylendopeptidase families (Tan, F. et ah (1993) J. Biol. Chem. 268:16631- 
16638). 

Cysteine proteases (CPs) have a cysteine as the major catalytic residue at an active site where 
catalysis proceeds via an intermediate thiol ester and is facilitated by adjacent histidine and aspartic 
acid residues. CPs are involved in diverse cellular processes ranging from the processing of precursor 
proteins to intracellular degradation. Mammalian CPs include lysosomal cathepsins and cytosolic 
calcium activated proteases, calpains. CPs are produced by monocytes, macrophages and other cells 
of the immune system which migrate to sites of inflammation and secrete molecules involved in 
tissue repair. Overabundance of these repair molecules plays a role in certain disorders. In 
autoimmune diseases such as rheumatoid arthritis, secretion of the cysteine peptidase cathepsin C 
degrades collagen, laminin, elastin and other structural proteins found in the extracellular matrix of 
bones. 

Aspartic proteases are members of the cathepsin family of lysosomal proteases and include 
pepsin A, gastricsin, chymosin, renin, and cathepsins D and E. Aspartic proteases have a pair of 
aspartic acid residues in the active site, and are most active in the pH 2 - 3 range, in which one of the 
aspartate residues is ionized, the other un-ionized. Aspartic proteases include bacterial 
penicillopepsin, mammalian pepsin, renin, chymosin, and certain fungal proteases. Abnormal 
regulation and expression of cathepsins is evident in various inflammatory disease states. In cells 
isolated from inflamed synovia, the mRNA for stromelysin, cytokines, TIMP-1, cathepsin, gelatinase, 
and other molecules is preferentially expressed. Expression of cathepsins L and D is elevated in 
synovial tissues from patients with rheumatoid arthritis and osteoarthritis. Cathepsin L expression may 
also contribute to the influx of mononuclear cells which exacerbates the destruction of the rheumatoid 
synovium. (Keyszer, G.M. (1995) Arthritis Rheum. 38:976-984.) The increased expression and 
differential regulation of the cathepsins are linked to the metastatic potential of a variety of cancers and 
as such are of therapeutic and prognostic interest (Chambers, A.F. et al. (1993) Crit. Rev. Oncog. 
4:95-114). 

Metalloproteases have active sites that include two glutamic acid residues and one histidine 
residue that serve as binding sites for zinc. Carboxypeptidases A and B are the principal mammalian 
metalloproteases. Both are exoproteases of similar structure and active sites. Carboxypeptidase A, 
like chymotrypsin, prefers C-terminal aromatic and aliphatic side chains of hydrophobic nature, 
whereas carboxypeptidase B is directed toward basic arginine and lysine residues. Glycoprotease 
(GCP), or O-sialoglycoprotein endopeptidase, is a metallopeptidase which specifically cleaves 
Osialoglycoproteins such as glycophorin A. Another metallopeptidase, placental leucine 
aminopeptidase (P-LAP) degrades several peptide hormones such as oxytocin and vasopressin, 
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suggesting a role in maintaining homeostasis during pregnancy, and is expressed in several tissues 
(Rogi, T. et al. (1996) J. Biol. Chem. 271:56-61). 

Ubiquitin proteases are associated with the ubiquitin conjugation system (UCS), a major 
pathway for the degradation of cellular proteins in eukaryotic cells and some bacteria. The UCS 
5 mediates the elimination of abnormal proteins and regulates the half-lives of important regulatory 
proteins that control cellular processes such as gene transcription and cell cycle progression. In the 
UCS pathway, proteins targeted for degradation are conjugated to a ubiquitin, a small heat stable 
protein. The ubiquitinated protein is then recognized and degraded by proteasome, a large, 
multisubunit proteolytic enzyme complex, and ubiquitin is released for reutilization by ubiquitin 

10 protease. The UCS is implicated in the degradation of mitotic cyclic kinases, oncoproteins, tumor 
suppressor genes such as p53, viral proteins, cell surface receptors associated with signal 
transduction, transcriptional regulators, and mutated or damaged proteins (Ciechanover, A. (1994) 
Cell 79:13-21). A murine proto-oncogene, Unp, encodes a nuclear ubiquitin protease whose 
overexpression leads to oncogenic transformation of NIH3T3 cells, and the human homolog of this 

15 gene is consistently elevated in small cell tumors and adenocarcinomas of the lung (Gray, D.A. 
(1995) Oncogene 10:2179-2183). 
Signal Peptidases 

The mechanism for the translocation process into the endoplasmic reticulum (ER) involves 
the recognition of an N-terminal signal peptide on the elongating protein. The signal peptide directs 
20 the protein and attached ribosome to a receptor on the ER membrane. The polypeptide chain passes 
through a pore in the ER membrane into the lumen while the N-terminal signal peptide remains 
attached at the membrane surface. The process is completed when signal peptidase located inside the 
ER cleaves the signal peptide from the protein and releases the protein into the lumen. 
Protease Inhibitors 

2 5 Protease inhibitors and other regulators of protease activity control the activity and effects of 

proteases. Protease inhibitors have been shown to control pathogenesis in animal models of 
proteolytic disorders (Murphy, G. (1991) Agents Actions Suppl. 35:69-76). Low levels of the 
cystatins, low molecular weight inhibitors of the cysteine proteases, correlate with malignant 
progression of tumors. (Calkins, C. et al (1995) Biol. Biochem. Hoppe Seyler 376:71-80). Serpins 
30 are inhibitors of mammalian plasma serine proteases. Many serpins serve to regulate the blood 
clotting cascade and/or the complement cascade in mammals. Sp32 is a positive regulator of the 
mammalian acrosomal protease, acrosin, that binds the proenzyme, proacrosin, and thereby aides in 
packaging the enzyme into the acrosomal matrix (Baba, T. et al. (1994) J. Biol. Chem. 269:10133- 
10140). The Kunitz family of serine protease inhibitors are characterized by one or more "Kunitz 

3 5 domains" containing a series of cysteine residues that are regularly spaced over approximately 50 
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amino acid residues and form three intrachain disulfide bonds. Members of this family include 
aprotinin, tissue factor pathway inhibitor (TFPI-1 and TFPI-2), inter-a-trypsin inhibitor, and bikunin. 
(Marlor, C.W. et al. (1997) J. Biol. Chem. 272:12202-12208.) Members of this family are potent 

* 

inhibitors (in the nanomolar range) against serine proteases such as kallikrein and plasmin. Aprotinin 
5 has clinical utility in reduction of perioperative blood loss. 

A major portion of all proteins synthesized in eukaryotic cells are synthesized on the 
cytosolic surface of the endoplasmic reticulum (ER). Before these immature proteins are distributed 
to other organelles in the cell or are secreted, they must be transported into the interior lumen of the 
ER where post-translational modifications are performed. These modifications include protein folding 
10 and the formation of disulfide bonds, and N-linked glycosylations. 
Protein Isomerases 

Protein folding in the ER is aided by two principal types of protein isomerases, protein 
disulfide isomerase (PDI), and peptidyl-prolyl isomerase (PPI). PDI catalyzes the oxidation of free 
sulfhydryl groups in cysteine residues to form intramolecular disulfide bonds in proteins. PPI, an 

1 5 enzyme that catalyzes the isomerization of certain proline imidic bonds in oligopeptides and proteins, 
is considered to govern one of the rate limiting steps in the folding of many proteins to their final 
functional conformation. The cyclophilins represent a major class of PPI that was originally 
identified as the major receptor for the immunosuppressive drug cyclosporin A (Handschumacher, 
R.E. et al. (1984) Science 226: 544-547). 

20 Protein Glvcosvlation 

The glycosylation of most soluble secreted and membrane-bound proteins by 
oligosaccharides linked to asparagine residues in proteins is also performed in the ER. This reaction 
is catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. Although the exact purpose 
of this "N-linked" glycosylation is unknown, the presence of oligosaccharides tends to make a 

25 glycoprotein resistant to protease digestion. In addition, oligosaccharides attached to cell-surface 
proteins called selectins are known to function in cell-cell adhesion processes (Alberts, B. et al. 
(1994) Molecular Biology of the Cell , Garland Publishing Co., New York NY, p.608). "O-liriked" 
glycosylation of proteins also occurs in the ER by the addition of N-acetylgalactosamine to the 
hydroxyl group of a serine or threonine residue followed by the sequential addition of other sugar 

30 residues to the first. This process is catalysed by a series of glycosyltransferases each specific for a 
particular donor sugar nucleotide and acceptor molecule (Lodish, H. et al. (1995) Molecular Cell 
Biology , W.H. Freeman and Co., New York NY, pp.700-708). In many cases, both N- and O-linked 
oligosaccharides appear to be required for the secretion of proteins or the movement of plasma 
membrane glycoproteins to the cell surface. 
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An additional glycosylation mechanism operates in the ER specifically to target lysosomal 
enzymes to lysosomes and prevent their secretion. Lysosomal enzymes in the ER receive an N-linked 
oligosaccharide, like plasma membrane and secreted proteins, but are then phosphorylated on one or 
two mannose residues. Hie phosphorylation of mannose residues occurs in two steps, the first step 
5 being the addition of an N-acetylglucosamine phosphate residue by N-acetylglucosamine 
phosphotransferase, and the second the removal of the N-acetylglucosamine group by 
phosphodiesterase. The phosphorylated mannose residue then targets the lysosomal enzyme to a 
mannose 6-phosphate receptor which transports it to a lysosome vesicle (Lodish, supra , pp. 708-71 1). 
Chaperones 

10 Molecular chaperones are proteins that aid in the proper folding of immature proteins and 

refolding of improperly folded ones, the assembly of protein subunits, and in the transport of 
unfolded proteins across membranes. Chaperones are also called heat-shock proteins (hsp) because of 
their tendency to be expressed in dramatically increased amounts following brief exposure of cells to 
elevated temperatures. This latter property most likely reflects their need in the refolding of proteins 

1 5 that have become denatured by the high temperatures. Chaperones may be divided into several 

classes according to their location, function, and molecular weight, and include hsp60, TCP1, hsp70, 
hsp40 (also called DnaJ), and hsp90. For example, hsp90 binds to steroid hormone receptors, 
represses transcription in the absence of the ligand, and provides proper folding of the ligand-binding 
domain of the receptor in the presence of the hormone (Burston, S.G. and A.R. Clarke (1995) Essays 

20 Biochem. 29:125-136). Hsp60 and hsp70 chaperones aid in the transport and folding of newly 

synthesized proteins. Hsp70 acts early in protein folding, binding a newly synthesized protein before 
it leaves the ribosome and transporting the protein to the mitochondria or ER before releasing the 
folded protein. Hsp60, along with hsplO, binds misfolded proteins and gives them the opportunity to 
refold correctly. All chaperones share an affinity for hydrophobic patches on incompletely folded 

25 proteins and the ability to hydrolyze ATP. The energy of ATP hydrolysis is used to release the hsp- 
bound protein in its properly folded state (Alberts, supra , pp 214, 571-572). 

Nucleic Acid Synthesis and Modification Molecules 

SEQ ID NO:35 and SEQ ID NO:36 encode, for example, nucleic acid synthesis and 
3 o modification molecules. 
Polymerases 

DNA and RNA replication are critical processes for cell replication and function. DNA and 
RN A replication are mediated by the enzymes DNA and RNA polymerase, respectively, by a 
"templating" process in which the nucleotide sequence of a DNA or RNA strand is copied by 
3 5 complementary base-pairing into a complementary nucleic acid sequence of either DNA or RNA. 
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However, there are fundamental differences between the two processes. 

DNA polymerase catalyzes the stepwise addition of a deoxyribonucleotide to the 3'-OH end 
of a polynucleotide strand (the primer strand) that is paired to a second (template) strand. The new 
DNA strand therefore grows in the 5' to 3' direction (Alberts, B. et al. (1994) 7116 Molecular Biology 
5 of theCelL Garland Publishing Inc., New York NY, pp. 251-254). The substrates for the 

polymerization reaction are the corresponding deoxynucleotide triphosphates which must base-pair 
with the correct nucleotide on the template strand in order to be recognized by the polymerase. 
Because DNA exists as a double-stranded helix, each of the two strands may serve as a template for 
the formation of a new complementary strand. Each of the two daughter cells of the dividing cell 
10 therefore inherits a new DNA double helix containing one old and one new strand. Thus, DNA is 
said to be replicated "semiconservatively" by DNA polymerase. In addition to the synthesis of new 
DNA, DNA polymerase is also involved in the repair of damaged DNA as discussed below under 
"Ligases." 

In contrast to DNA polymerase, RNA polymerase uses a DNA template strand to "transcribe" 
15 DNA into RNA using ribonucleotide triphosphates as substrates. Like DNA polymerization, RNA 
polymerization proceeds in a 5' to 3' direction by addition of a ribonucleoside monophosphate to the 
3' -OH end of a growing RNA chain. DNA transcription generates messenger RNAs (mRNA) that 
carry information for protein synthesis, as well as the transfer, ribosomal, and other RNAs that have 
structural or catalytic functions. In eukaryotes, three discrete RNA polymerases synthesize the three 
2 o different types of RNA (Alberts, supra , pp. 367-368). RNA polymerase I makes the large ribosomal 
RNAs, RNA polymerase n makes the mRNAs that will be translated into proteins, and RNA 
polymerase III makes a variety of small, stable RNAs, including 5S ribosomal RNA and the transfer 
RNAs (tRNA). In all cases, RNA synthesis is initiated by binding of the RNA polymerase to a 
promoter region on the DNA and synthesis begins at a start site within the promoter. Synthesis is 

2 5 completed at a broad, general stop or termination region in the DNA where both the polymerase and 

the completed RNA chain are released. 
Ligases 

DNA repair is the process by which accidental base changes, such as those produced by 
oxidative damage, hydrolytic attack, or uncontrolled methylation of DNA are corrected before 

3 o replication or transcription of the DNA can occur. Because of the efficiency of the DNA repair 

process, fewer than one in one thousand accidental base changes causes a mutation (Alberts, supra , 
pp. 245-249). The three steps common to most types of DNA repair are (1) excision of the damaged 
or altered base or nucleotide by DNA nucleases, leaving a gap; (2) insertion of the correct nucleotide 
in this gap by DNA polymerase using the complementary strand as the template; and (3) sealing the 
3 5 break left between the inserted nucleotide(s) and the existing DNA strand by DNA ligase. In the last 
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reaction, DNA ligase uses the energy from ATP hydrolysis to activate the 5* end of the broken 
phosphodiester bond before forming the new bond with the 3*-OH of the DNA strand. In Bloom's 
syndrome, an inherited human disease, individuals are partially deficient in DNA ligation and 
consequently have an increased incidence of cancer (Alberts, supra , p. 247). 
5 Nucleases 

Nucleases comprise both enzymes that hydrolyze DNA (DNase) and RNA (RNase). They 
serve different purposes in nucleic acid metabolism. Nucleases hydrolyze the phosphodiester bonds 
between adjacent nucleotides either at internal positions (endonucleases) or at the terminal 3' or 5* 
nucleotide positions (exonucleases). A DNA exonuclease activity in DNA polymerase, for example, 

10 serves to remove improperly paired nucleotides attached to the 3' -OH end of the growing DNA strand 
by the polymerase and thereby serves a "proofreading" function. As mentioned above, DNA 
endonuclease activity is involved in the excision step of the DNA repair process. 

RNases also serve a variety of functions. For example, RNase P is a ribonucleoprotein 
enzyme which cleaves the 5* end of pre-tRNAs as part of their maturation process. RNase H digests 

15 the RNA strand of an RNA/DNA hybrid. Such hybrids occur in cells invaded by retroviruses, and 
RNase H is an important enzyme in the retroviral replication cycle. Pancreatic RNase secreted by the 
pancreas into the intestine hydrolyzes RNA present in ingested foods. RNase activity in serum and 
cell extracts is elevated in a variety of cancers and infectious diseases (Schein, C.H. (1997) Nat. 
Biotechnol. 15:529-536). Regulation of RNase activity is being investigated as a means to control 

20 tumor angiogenesis, allergic reactions, viral infection and replication, and fungal infections. 
Methyl ases 

Methylation of specific nucleotides occurs in both DNA and RNA, and serves different 
functions in the two macromolecules. Methylation of cytosine residues to form 5-methyl cytosine in 
DNA occurs specifically at CG sequences which are base-paired with one another in the DNA double- 

2 5 helix. This pattern of methylation is passed from generation to generation during DNA replication by 

an enzyme called "maintenance methylase" that acts preferentially on those CG sequences that are 
base-paired with a CG sequence that is already methylated. Such methylation appears to distinguish 
active from inactive genes by preventing the binding of regulatory proteins that "turn on" the gene, 
but permit the binding of proteins that inactivate the gene (Alberts, supra , pp. 448-45 1). In RNA 

3 o metabolism, "tRNA methylase" produces one of several nucleotide modifications in tRNA that affect 

the conformation and base-pairing of the molecule and facilitate the recognition of the appropriate 
mRNA codons by specific tRNAs. The primary methylation pattern is the dimethylation of guanine 
residues to form N,N-dimethyl guanine. 
Helicases and Single-Stranded Binding Proteins 
3 5 Helicases are enzymes that destabilize and unwind double helix structures in both DNA and 
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RNA. Since DN A replication occurs more or less simultaneously on both strands, the two strands 
must first separate to generate a replication "fork" for DNA polymerase to act on. Two types of 
replication proteins contribute to this process, DNA helicases and single-stranded binding proteins. 
DNA helicases hydrolyze ATP and use the energy of hydrolysis to separate the DNA strands. Single- 
5 stranded binding proteins (SSBs) then bind to the exposed DNA strands without covering the bases, 
thereby temporarily stabilizing them for templating by the DNA polymerase (Alberts, supra , pp. 255- 
256). 

RN A helicases also alter and regulate RNA conformation and secondary structure. Like the 
DNA helicases, RNA helicases utilize energy derived from ATP hydrolysis to destabilize and unwind 

10 RNA duplexes. The most well-characterized and ubiquitous family of RNA helicases is the DEAD- 
box family, so named for the conserved B-type ATP-binding motif which is diagnostic of proteins in 
this family. Over 40 DEAD-box helicases have been identified in organisms as diverse as bacteria, 
insects, yeast, amphibians, mammals, and plants. DEAD-box helicases function in diverse processes 
such as translation initiation, splicing, ribosome assembly, and RNA editing, transport, and stability. 

15 Some DEAD-box helicases play tissue- and stage-specific roles in spermatogenesis and 

embryogenesis. Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the 
progression of neuroblastoma (Nb) and retinoblastoma (Rb) tumors (Godbout, R. et al. (1998) J. Biol. 
Chem. 273:21 161-21 168). These observations suggest that DDX1 may promote or enhance tumor 
progression by altering the normal secondary structure and expression levels of RNA in cancer cells. 

2 o Other DEAD-box helicases have been implicated either directly or indirectly in tumorigenesis 

(Discussed in Godbout, supra) . For example, murine p68 is mutated in ultraviolet light-induced 
tumors, and human DDX6 is located at a chromosomal breakpoint associated with B-cell lymphoma. 
Similarly, a chimeric protein comprised of DDX10 and NUP98, a nucleoporin protein, may be 
involved in the pathogenesis of certain myeloid malignancies. 
25 Topoisomerases 

Besides the need to separate DNA strands prior to replication, the two strands must be 
"unwound" from one another prior to their separation by DNA helicases. This function is performed 
by proteins known as DNA topoisomerases. DNA topoisomerase effectively acts as a reversible 
nuclease that hydrolyzes a phosphodiesterase bond in a DNA strand, permitting the two strands to 

3 o rotate freely about one another to remove the strain of the helix, and then rejoins the original 

phosphodiester bond between the two strands. Two types of DNA topoisomerase exist, types I and II. 
DNA Topoisomerase I causes a single-strand break in a DNA helix to allow the rotation of the two 
strands of the helix about the remaining phosphodiester bond in the opposite strand. DNA 
topoisomerase II causes a transient break in both strands of a DNA helix where two double helices 
3 5 cross over one another. This type of topoisomerase can efficiently separate two interlocked DNA 
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circles (Alberts, supra , pp.260-262). Type II topoisomerases are largely confined to proliferating cells 
in eukaryotes, such as cancer cells. For this reason they are targets for anticancer drugs. 
Topoisomerase II has been implicated in multi-drug resistance (MDR) as it appears to aid in the repair 
of DNA damage inflicted by DN A binding agents such as doxorubicin and vincristine. 
5 Recombinases 

Genetic recombination is the process of rearranging DNA sequences within an organism's 
genome to provide genetic variation for the organism in response to changes in the environment. 
DNA recombination allows variation in the particular combination of genes present in an individual's 
genome, as well as the timing and level of expression of these genes (see Alberts, supra , pp. 263- 

10 273). Two broad classes of genetic recombination are commonly recognized, general recombination 
and site-specific recombination. General recombination involves genetic exchange between any 
homologous pair of DNA sequences usually located on two copies of the same chromosome. The 
process is aided by enzymes called recombinases that "nick" one strand of a DNA duplex more or less 
randomly and permit exchange with the complementary strand of another duplex. The process does 

is not normally change the arrangement of genes on a chromosome. In site-specific recombination, the 
recombinase recognizes specific nucleotide sequences present in one or both of the recombining 
molecules. Base-pairing is not involved in this form of recombination and therefore does not require 
DNA homology between the recombining molecules. Unlike general recombination, this form of 
recombination can alter the relative positions of nucleotide sequences in chromosomes. 

20 Splicing Factors 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre- 
mRNA processing steps include capping at the 5 1 end with methylguanosine, polyadenylating the 3* 
end, and splicing to remove introns. The primary RNA transcript from DNA is a faithful copy of the 
gene containing both exon and intron sequences, and the latter sequences must be cut out of the RNA 

2 5 transcript to produce an mRN A that codes for a protein. This "splicing" of the mRNA sequence takes 

place in the nucleus with the aid of a large, multicomponent ribonucleoprotein complex known as a 
spliceosome. The spliceosomal complex is composed of five small nuclear ribonucleoprotein 
particles (snRNPs) designated Ul, U2, U4, U5, and U6, and a number of additional proteins. Each 
snRNP contains a single species of snRNA and about ten proteins. The RNA components of some 
30 snRNPs recognize and base pair with intron consensus sequences. The protein components mediate 
spliceosome assembly and the splicing reaction. Autoantibodies to snRNP proteins are found in the 
blood of patients with systemic lupus erythematosus (Stryer, L. (1995) Biochemistry , W.H. Freeman 
and Company, New York NY, p. 863). 

3 5 Adhesion Molecules 



79 



WO 01/21836 PCT/US00/25643 

The surface of a cell is rich in transmembrane proteoglycans, glycoproteins, glycolipids, and 
receptors. These macromolecules mediate adhesion with other cells and with components of the 
extracellular matrix (ECM). The interaction of the cell with its surroundings profoundly influences 
cell shape, strength, flexibility, motility, and adhesion. These dynamic properties are intimately 
5 associated with signal transduction pathways controlling cell proliferation and differentiation, tissue 
construction, and embryonic development. 
Cadherins 

Cadherins comprise a family of calcium-dependent glycoproteins that function in mediating 
cell-cell adhesion in virtually all solid tissues of multicellular organisms. These proteins share 

10 multiple repeats of a cadherin-specific motif, and the repeats form the folding units of the cadherin 
extracellular domain. Cadherin molecules cooperate to form focal contacts, or adhesion plaques, 
between adjacent epithelial cells. The cadherin family includes the classical cadherins and 
protocadherins. Classical cadherins include the E-cadherin, N-cadherin, and P-cadherin subfamilies. 
E-cadherin is present on many types of epithelial cells and is especially important for embryonic 

15 development N-cadherin is present on nerve, muscle, and lens cells and is also critical for embryonic 
development P-cadherin is present on cells of the placenta and epidermis. Recent studies report that 
protocadherins are involved in a variety of cell-cell interactions (Suzuki, S.T. (1996) J. Cell Sci. 
109:2609-261 1). The intracellular anchorage of cadherins is regulated by their dynamic association 
with catenins, a family of cytoplasmic signal transduction proteins associated with the actin 

2 o cytoskeleton. The anchorage of cadherins to the actin cytoskeleton appears to be regulated by protein 
tyrosine phosphorylation, and the cadherins are the target of phosphorylation-induced junctional 
disassembly (Aberle, H. et al. (1996) J. Cell. Biochem. 61:514-523). 
Integrins 

Integrins are ubiquitous transmembrane adhesion molecules that link the ECM to the internal 

2 5 cytoskeleton. Integrins are composed of two noncovalently associated transmembrane glycoprotein 

subunits called a and (J. Integrins function as receptors that play a role in signal transduction. For 
example, binding of integrin to its extracellular ligand may stimulate changes in intracellular calcium 
levels or protein kinase activity (Sjaastad, M.D. and W.J. Nelson (1997) BioEssays 19:47-55). At 
least ten cell surface receptors of the integrin family recognize the ECM component fibronectin, 

3 o which is involved in many different biological processes including cell migration and embryogenesis 

(Johansson, S. etal. (1997) Front. Biosci. 2:D126-D146). 
Lectins 

Lectins comprise a ubiquitous family of extracellular glycoproteins which bind cell surface 
carbohydrates specifically and reversibly, resulting in the agglutination of cells (reviewed in 
35 Drickamer, K. and M.E. Taylor (1993) Annu. Rev. Cell Biol. 9:237-264). This function is 
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particularly important for activation of the immune response. Lectins mediate the agglutination and 
mitogenic stimulation of lymphocytes at sites of inflammation (Lasky, L.A. (1991) J. Cell. Biochem. 
45:139-146; Paietta, E. et al. (1989) J. Immunol. 143:2850-2857). 

Lectins are further classified into subfamilies based on carbohydrate-binding specificity and 
5 other criteria. The galectin subfamily, in particular, includes lectins that bind p-galactoside 

carbohydrate moieties in a thiol-dependent manner (reviewed in Hadari, Y.R. et al. (1998) J. Biol. 
Chem. 270:3447-3453). Galectins are widely expressed and developmentally regulated. Because all 
galectins lack an N-terminal signal peptide, it is suggested that galectins are externalized through an 
atypical secretory mechanism. Two classes of galectins have been defined based on molecular weight 
10 and oligomerization properties. Small galectins form homodimers and are about 14 to 16 kilodaltons 
in mass, while large galectins are monomeric and about 29-37 kilodaltons. 

Galectins contain a characteristic carbohydrate recognition domain (CRD). The CRD is 
about 140 amino acids and contains several stretches of about 1-10 amino acids which are highly 
conserved among all galectins. A particular 6-amino acid motif within the CRD contains conserved 
15 tryptophan and arginine residues which are critical for carbohydrate binding. The CRD of some 
galectins also contains cysteine residues which may be important for disulfide bond formation. 
Secondary structure predictions indicate that the CRD forms several P-sheets. 

Galectins play a number of roles in diseases and conditions associated with cell-cell and cell- 
matrix interactions. For example, certain galectins associate with sites of inflammation and bind to 

2 o cell surface immunoglobulin E molecules. In addition, galectins may play an important role in cancer 

metastasis. Galectin overexpression is correlated with the metastatic potential of cancers in humans 
and mice. Moreover, anti-gal ectin antibodies inhibit processes associated with cell transformation, 
such as cell aggregation and anchorage-independent growth (See, for example, Su, Z.-Z. et al. (1996) 
Proc. Natl. Acad. Sci. USA 93:7252-7257). 
25 Selectins 

Selectins, or LEC-C AMs, comprise a specialized lectin subfamily involved primarily in 
inflammation and leukocyte adhesion (Reviewed in Lasky, supra) . Selectins mediate the recruitment 
of leukocytes from the circulation to sites of acute inflammation and are expressed on the surface of 
vascular endothelial cells in response to cytokine signaling. Selectins bind to specific ligands on the 

3 o leukocyte cell membrane and enable the leukocyte to adhere to and migrate along the endothelial 

surface. Binding of selectin to its ligand leads to polarized rearrangement of the actin cytoskeleton 
and stimulates signal transduction within the leukocyte (Brenner, B. et al. (1997) Biochem. Biophys. 
Res. Commun. 231:802-807; Hidari, K.I. et al. (1997) J. Biol. Chem. 272:28750-28756). Members 
of the selectin family possess three characteristic motifs: a lectin or carbohydrate recognition domain; 
35 an epidermal growth factor-like domain; and a variable number of short consensus repeats (scr or 
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"sushi" repeats) which are also present in complement regulatory proteins. The selectins include 
lymphocyte adhesion molecule- 1 (Lam-1 or L-selectin), endothelial leukocyte adhesion molecule- 1 
(ELAM-1 or E-selectin), and granule membrane protein- 140 (GMP-140 or P-selectin) (Johnston, G.I. 
et al. (1989) Cell 56:1033-1044). 

5 

Antigen Recognition Molecules 

SEQ ID NO:37 encodes, for example, an antigen recognition molecule. 
All vertebrates have developed sophisticated and complex immune systems that provide 
protection from viral, bacterial, fungal, and parasitic infections. A key feature of the immune system 

io is its ability to distinguish foreign molecules, or antigens, from "self 1 molecules. This ability is 
mediated primarily by secreted and transmembrane proteins expressed by leukocytes (white blood 
cells) such as lymphocytes, granulocytes, and monocytes. Most of these proteins belong to the 
immunoglobulin (Ig) superfamily, members of which contain one or more repeats of a conserved 
structural domain. This Ig domain is comprised of antiparallel P sheets joined by a disulfide bond in 

is an arrangement called the Ig fold. Members of the Ig superfamily include T-cell receptors, major 
histocompatibility (MHC) proteins, antibodies, and immune cell-specific surface markers such as 
CD4, CD8, andCD28. 

MHC proteins are cell surface markers that bind to and present foreign antigens to T cells. 
MHC molecules are classified as either class I or class II. Class I MHC molecules (MHC I) are 

2 o expressed on the surface of almost all cells and are involved in the presentation of antigen to 

cytotoxic T cells. For example, a cell infected with virus will degrade intracellular viral proteins and 
express the protein fragments bound to MHC I molecules on the cell surface. The MHC I/antigen 
complex is recognized by cytotoxic T-cells which destroy the infected cell and the virus within. 
Class II MHC molecules are expressed primarily on specialized antigen-presenting cells of the 

2 5 immune system, such as B-cells and macrophages. These cells ingest foreign proteins from the 

extracellular fluid and express MHC H/antigen complex on the cell surface. This complex activates 
helper T-cells, which then secrete cytokines and other factors that stimulate the immune response. 
MHC molecules also play an important role in organ rejection following transplantation. Rejection 
occurs when the recipient's T-cells respond to foreign MHC molecules on the transplanted organ in 

30 the same way as to self MHC molecules bound to foreign antigen. (Reviewed in Alberts, B. et al. 
(1994) Molecular Biology of the Cell , Garland Publishing, New York NY, pp. 1229-1246.) 

Antibodies, or immunoglobulins, are either expressed on the surface of B-cells or secreted by 
B-cells into the circulation. Antibodies bind and neutralize foreign antigens in the blood and other 
extracellular fluids. The prototypical antibody is a tetramer consisting of two identical heavy 

35 polypeptide chains (H-chains) and two identical light polypeptide chains (L-chains) interlinked by 
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disulfide bonds. This arrangement confers the characteristic Y-shape to antibody molecules. 
Antibodies are classified based on their H-chain composition. The five antibody classes, IgA, IgD, 
IgE, IgG and IgM, are defined by the a, 8, e, y, and \x H-chain types. There are two types of L- 
chains, k and X, either of which may associate as a pair with any H-chain pair. IgG, the most 
5 common class of antibody found in the circulation, is tetrameric, while the other classes of antibodies 
are generally variants or multimers of this basic structure. 

H-chains and L-chains each contain an N-terminal variable region and a C-terminal constant 
region. The constant region consists of about 110 amino acids in L-chains and about 330 or 440 
amino acids in H-chains. The amino acid sequence of the constant region is nearly identical among 

10 H- or L-chains of a particular class. The variable region consists of about 110 amino acids in both H- 
and L-chains. However, the amino acid sequence of the variable region differs among H- or L-chains 
of a particular class. Within each H- or L-chain variable region are three hypervariable regions of 
extensive sequence diversity, each consisting of about 5 to 10 amino acids. In the antibody molecule, 
the H- and L-chain hypervariable regions come together to form the antigen recognition site. 

is (Reviewed in Alberts, supra , pp. 1206-1213 and 1216-1217.) 

Both H-chains and L-chains contain repeated Ig domains. For example, a typical H-chain 
contains four Ig domains, three of which occur within the constant region and one of which occurs 
within the variable region and contributes to the formation of the antigen recognition site. Likewise, 
a typical L-chain contains two Ig domains, one of which occurs within the constant region and one of 

2 o which occurs within the variable region. 

The immune system is capable of recognizing and responding to any foreign molecule that 
enters the body. Therefore, the immune system must be armed with a full repertoire of antibodies 
against all potential antigens. Such antibody diversity is generated by somatic rearrangement of gene 
segments encoding variable and constant regions. These gene segments are joined together by site- 

2 5 specific recombination which occurs between highly conserved DNA sequences that flank each gene 

segment. Because there are hundreds of different gene segments, millions of unique genes can be 
generated combinatorially . In addition, imprecise joining of these segments and an unusually high 
rate of somatic mutation within these segments further contribute to the generation of a diverse 
antibody population. 

3 o T-cell receptors are both structurally and functionally related to antibodies. (Reviewed in 

Alberts, supra , pp. 1228-1229.) T-cell receptors are cell surface proteins that bind foreign antigens and 
mediate diverse aspects of the immune response. A typical T-cell receptor is a heterodimer comprised 
of two disulfide-linked polypeptide chains called a and p. Each chain is about 280 amino acids in 
length and contains one variable region and one constant region. Each variable or constant region folds 
3 5 into an Ig domain. The variable regions from the a and P chains come together in the heterodimer to 
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form the antigen recognition site. T-cell receptor diversity is generated by somatic rearrangement of 
gene segments encoding the a and p chains. T-cell receptors recognize small peptide antigens that are 
expressed on the surface of antigen-presenting cells and pathogen-infected cells. These peptide antigens 
are presented on the cell surface in association with major histocompatibility proteins which provide the 
5 proper context for antigen recognition. 

Secreted and Extracellular Matrix Molecules 

SEQ ID NO:38 and SEQ ID NO:39 encode, for example, secreted/extracellular matrix 
molecules. 

10 Protein secretion is essential for cellular function. Protein secretion is mediated by a signal 

peptide located at the amino terminus of the protein to be secreted. The signal peptide is comprised of 
about ten to twenty hydrophobic amino acids which target the nascent protein from the ribosome to the 
endoplasmic reticulum (ER). Proteins targeted to the ER may either proceed through the secretory 
pathway or remain in any of the secretory organelles such as the ER, Golgi apparatus, or lysosomes. 

1 5 Proteins that transit through the secretory pathway are either secreted into the extracellular space or 
retained in the plasma membrane. Secreted proteins are often synthesized as inactive precursors that 
are activated by post-translational processing events during transit through the secretory pathway. 
Such events include glycosylation, proteolysis, and removal of the signal peptide by a signal peptidase. 
Other events that may occur during protein transport include chaperone-dependent unfolding and 

2 o folding of the nascent protein and interaction of the protein with a receptor or pore complex. Examples 

of secreted proteins with amino terminal signal peptides include receptors, extracellular matrix 
molecules, cytokines, hormones, growth and differentiation factors, neuropeptides, vasomediators, ion 
channels, transporters/pumps, and proteases. (Reviewed in Alberts, B. et al. (1994) Molecular Biology 
of The Cell , Garland Publishing, New York NY, pp. 557-560, 582-592.) 
25 The extracellular matrix (ECM) is a complex network of glycoproteins, polysaccharides, 

proteoglycans, and other macromolecules that are secreted from the cell into the extracellular space. 
The ECM remains in close association with the cell surface and provides a supportive meshwork that 
profoundly influences cell shape, motility, strength, flexibility, and adhesion. In fact, adhesion of a 
cell to its surrounding matrix is required for cell survival except in the case of metastatic tumor cells, 

3 o which have overcome the need for cell-ECM anchorage. This phenomenon suggests that the ECM 

plays a critical role in the molecular mechanisms of growth control and metastasis. (Reviewed in 
Ruoslahti, E. (1996) Sci. Am. 275:72-77.) Furthermore, the ECM determines the structure and 
physical properties of connective tissue and is particularly important for morphogenesis and other 
processes associated with embryonic development and pattern formation. 
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The collagens comprise a family of ECM proteins that provide structure to bone, teeth, skin, 
ligaments, tendons, cartilage, blood vessels, and basement membranes. Multiple collagen proteins have 
been identified. Three collagen molecules fold together in a triple helix stabilized by interchain disulfide 
bonds. Bundles of these triple helices then associate to form fibrils. Collagen primary structure 
5 consists of hundreds of (Gly-X- Y) repeats where about a third of the X and Y residues are Pro. 

Glycines are crucial to helix formation as the bulkier amino acid sidechains cannot fold into the triple 
helical conformation. Because of these strict sequence requirements, mutations in collagen genes have 
severe consequences. Osteogenesis imperfecta patients have brittle bones that fracture easily; in severe 
cases patients die in utero or at birth. Ehlers-Darilos syndrome patients have hyperelastic skin, 

10 hypermobile joints, and susceptibility to aortic and intestinal rupture. Chondrodysplasia patients have 
short stature and ocular disorders. Alport syndrome patients have hematuria, sensorineural deafness, 
and eye lens deformation. (Isselbacher, K.J. et al. (1994) Harrison's Principles of Internal Medicine . 
McGraw-Hill, Inc., New York NY, pp. 2105-21 17; and Creighton, T.E. (1984) Proteins, Structures 
and Molecular Principles . W.H. Freeman and Company, New York NY, pp. 191-197.) 

15 Elastin and related proteins confer elasticity to tissues such as skin, blood vessels, and lungs. 

Elastin is a highly hydrophobic protein of about 750 amino acids that is rich in proline and glycine 
residues. Elastin molecules are highly cross-linked, forming an extensive extracellular network of fibers 
and sheets. Elastin fibers are surrounded by a sheath of microfibrils which are composed of a number 
of glycoproteins, including fibrillin. Mutations in the gene encoding fibrillin are responsible for 

20 Marfan' s syndrome, a genetic disorder characterized by defects in connective tissue. In severe cases, 
the aortas of afflicted individuals are prone to rupture. (Reviewed in Alberts, supra , pp. 984-986.) 

Fibronectin is a large ECM glycoprotein found in all vertebrates. Fibronectin exists as a dimer 
of two subunits, each containing about 2,500 amino acids. Each subunit folds into a rod-like structure 
containing multiple domains. The domains each contain multiple repeated modules, the most common 

25 of which is the type III fibronectin repeat. The type m fibronectin repeat is about 90 amino acids in 
length and is also found in other ECM proteins and in some plasma membrane and cytoplasmic 
proteins. Furthermore, some type III fibronectin repeats contain a characteristic tripeptide consisting of 
Arginine-Glycine-Aspartic acid (RGD). The RGD sequence is recognized by the integrin family of cell 
surface receptors and is also found in other ECM proteins. Disruption of both copies of the gene 

3 o encoding fibronectin causes early embryonic lethality in mice. The mutant embryos display extensive 
morphological defects, including defects in the formation of the notochord, somites, heart, blood 
vessels, neural tube, and extraembryonic structures. (Reviewed in Alberts, supra , pp. 986-987.) 

Laminin is a major glycoprotein component of the basal lamina which underlies and supports 
epithelial cell sheets. Laminin is one of the first ECM proteins synthesized in the developing embryo. 
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Laminin is an 850 kilodalton protein composed of three polypeptide chains joined in the shape of a 
cross by disulfide bonds. Laminin is especially important for angiogenesis and in particular, for 
guiding the formation of capillaries. (Reviewed in Alberts, supra , pp. 990-991.) 

There are many other types of proteinaceous ECM components, most of which can be 
5 classified as proteoglycans. Proteoglycans are composed of unbranched polysaccharide chains 

(glycosaminoglycans) attached to protein cores. Common proteoglycans include aggrecan, betaglycan, 
decorin, perlecan, serglycin, and syndecan-1. Some of these molecules not only provide mechanical 
support, but also bind to extracellular signaling molecules, such as fibroblast growth factor and 
transforming growth factor p, suggesting a role for proteoglycans in cell-cell communication and cell 
10 growth. (Reviewed in Alberts, supra , pp. 973-978.) Likewise, the glycoproteins tenascin-C and 
tenascin-R are expressed in developing and lesioned neural tissue and provide stimulatory and anti- 
adhesive (inhibitory) properties, respectively, for axonal growth. (Faissner, A. (1997) Cell Tissue Res. 
290:331-341.) 

15 Cytoskeletal Molecules 

SEQ ID NO.40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, and SEQ 
ID NO:45 encode, for example, cytoskeletal molecules. 

The cytoskeleton is a cytoplasmic network of protein fibers that mediate cell shape, structure, 
and movement. The cytoskeleton supports the cell membrane and forms tracks along which 

2 o organelles and other elements move in the cytosol. The cytoskeleton is a dynamic structure that 

allows cells to adopt various shapes and to carry out directed movements. Major cytoskeletal fibers 
include the microtubules, the microfilaments, and the intermediate filaments. Motor proteins, 
including myosin, dynein, and kinesin, drive movement of or along the fibers. The motor protein 
dynamin drives the formation of membrane vesicles. Accessory or associated proteins modify the 
25 structure or activity of the fibers while cytoskeletal membrane anchors connect the fibers to the cell 
membrane. 
Tubulins 

Microtubules, cytoskeletal fibers with a diameter of about 24 nm, have multiple roles in the 
cell. Bundles of microtubules form cilia and flagella, which are whip-like extensions of the cell 

3 o membrane that are necessary for sweeping materials across an epithelium and for swimming of 

sperm, respectively. Marginal bands of microtubules in red blood cells and platelets are important for 
these cells* pliability. Organelles, membrane vesicles, and proteins are transported in the cell along 
tracks of microtubules. For example, microtubules run through nerve cell axons, allowing bi- 
directional transport of materials and membrane vesicles between the cell body and the nerve 
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terminal. Failure to supply the nerve terminal with these vesicles blocks the transmission of neural 
signals. Microtubules are also critical to chromosomal movement during cell division. Both stable 
and short-lived populations of microtubules exist in the cell. 

Microtubules are polymers of GTP-binding tubulin protein subunits. Each subunit is a 
5 heterodimer of a- and p~ tubulin, multiple isoforms of which exist. The hydrolysis of GTP is linked 
to the addition of tubulin subunits at the end of a microtubule. The subunits interact head to tail to 
form protofilaments; the protofilaments interact side to side to form a microtubule. A microtubule is 
polarized, one end ringed with a -tubulin and the other with p-tubulin, and the two ends differ in their 
rates of assembly. Generally, each microtubule is composed of 13 protofilaments although 11 or 15 

10 protofilament-microtubules are sometimes found. Cilia and flageUa contain doublet microtubules. 
Microtubules grow from specialized structures known as centrosomes or microtubule-organizing 
centers (MTOCs). MTOCs may contain one or two centrioles, which are pinwheel arrays of triplet 
microtubules. The basal body, the organizing center located at the base of a cilium or flagellum, 
contains one centriole. Gamma tubulin present in the MTOC is important for nucleating the 

is polymerization of a- and P- tubulin heterodimers but does not polymerize into microtubules. 
Microtubule- Associated Proteins 

Microtubule-associated proteins (MAPs) have roles in the assembly and stabilization of 
microtubules. One major family of MAPs, assembly MAPs, can be identified in neurons as well as 
non-neuronal cells. Assembly MAPs are responsible for cross-linking microtubules in the cytosol. 

2 o These MAPs are organized into two domains: a basic microtubule-binding domain and an acidic 

projection domain. The projection domain is the binding site for membranes, intermediate filaments, or 
other microtubules. Based on sequence analysis, assembly MAPs can be further grouped into two 
types: Type I and Type II. Type I MAPs, which include MAPI A and MAP1B, are large, filamentous 
molecules that co-purify with microtubules and are abundantly expressed in brain and testes. Type I 

2 5 MAPs contain several repeats of a positively-charged amino acid sequence motif that binds and 

neutralizes negatively charged tubulin, leading to stabilization of microtubules. MAPI A and MAP1B 
are each derived from a single precursor polypeptide that is subsequently proteolytically processed to 
generate one heavy chain and one light chain. 

Another light chain, LC3, is a 16.4 kDa molecule that binds MAPI A, MAP1B, and 

30 microtubules. It is suggested that LC3 is synthesized from a source other than the MAP1A or MAP1B 
transcripts, and that the expression of LC3 may be important in regulating the microtubule binding 
activity of MAPI A and MAP1B during cell proliferation (Mann, S.S. et al. (1994) J. Biol. Chem. 
269:11492-11497). 

Type II MAPs, which include MAP2a, MAP2b, MAP2c, MAP4, and Tau, are characterized 
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by three to four copies of an 1 8-residue sequence in the microtubule-binding domain. MAP2a, M AP2b, 
and MAP2c are found only in dendrites, MAP4 is found in non-neuronal cells, and Tau is found in 
axons and dendrites of nerve cells. Alternative splicing of the Tau mRN A leads to the existence of 
multiple forms of Tau protein. Tau phosphorylation is altered in neurodegenerative disorders such as 
5 Alzheimer* s disease, Pick's disease, progressive supranuclear palsy, corticobasal degeneration, and 
familial frontotemporal dementia and Parkinsonism linked to chromosome 17. The altered Tau 
phosphorylation leads to a collapse of the microtubule network and the formation of intraneuronal 
Tau aggregates (Spillantini, M.G. and M. Goedert (1998) Trends Neurosci. 21:428-433). 

The protein pericentrin is found in the MTOC and has a role in microtubule assembly. 

10 Actins 

Microfilaments, cytoskeletal filaments with a diameter of about 7-9 nm, are vital to cell 
locomotion, cell shape, cell adhesion, cell division, and muscle contraction. Assembly and 
disassembly of the microfilaments allow cells to change their morphology. Microfilaments are the 
polymerized form of actin, the most abundant intracellular protein in the eukaryotic cell. Human cells 

15 contain six isoforms of actin. The three a-actins are found in different kinds of muscle, nonmuscle P- 
actin and nonmuscle y-actin are found in nonmuscle cells, and another y-actin is found in intestinal 
smooth muscle cells. G-actin, the monomelic form of actin, polymerizes into polarized, helical F- 
actin filaments, accompanied by the hydrolysis of ATP to ADP. Actin filaments associate to form 
bundles and networks, providing a framework to support the plasma membrane and determine cell 

20 shape. These bundles and networks are connected to the cell membrane. In muscle cells, thin 
filaments containing actin slide past thick filaments containing the motor protein myosin during 
contraction. A family of actin-related proteins exist that are not part of the actin cytoskeleton, but 
rather associate with microtubules and dynein. 
Actin- Associated Proteins 

2 5 Actin-associated proteins have roles in cross-linking, severing, and stabilization of actin 

filaments and in sequestering actin monomers. Several of the actin-associated proteins have multiple 
functions. Bundles and networks of actin filaments are held together by actin cross-linking proteins. 
These proteins have two actin-binding sites, one for each filament. Short cross-linking proteins 
promote bundle formation while longer, more flexible cross-linking proteins promote network 
30 formation. Calmodulin-like calcium-binding domains in actin cross-linking proteins allow calcium 
regulation of cross-linking. Group I cross-linking proteins have unique actin-binding domains and 
include the 30 kD protein, EF-la, fascin, and scruin. Group II cross-linking proteins have a 7,000- 
MW actin-binding domain and include villin and dematin. Group in cross-linking proteins have 
pairs of a 26,000-MW actin-binding domain and include fimbrin, spectrin, dystrophin, ABP 120, and 

3 5 filamin. 
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Severing proteins regulate the length of actin filaments by breaking them into short pieces or 
by blocking their ends. Severing proteins include gCAP39, severin (fragmin), gelsolin, and villin. 
Capping proteins can cap the ends of actin filaments, but cannot break filaments. Capping proteins 
include CapZ and tropomodulin. The proteins thymosin and profilin sequester actin monomers in the 
5 cytosol, allowing a pool of unpolymerized actin to exist. The actin-associated proteins tropomyosin, 
troponin, and caldesmon regulate muscle contraction in response to calcium. 
Intermediate Filaments and Associated Proteins 

Intermediate filaments (IFs) are cytoskeletal fibers with a diameter of about 10 nm, 
intermediate between that of microfilaments and microtubules. IFs serve structural roles in the cell, 

10 reinforcing cells and organizing cells into tissues. IFs are particularly abundant in epidermal cells and 
in neurons. IFs are extremely stable, and, in contrast to microfilaments and microtubules, do not 
function in cell motility. 

Five types of IF proteins are known in mammals. Type I and Type II proteins are the acidic 
and basic keratins, respectively. Heterodimers of the acidic and basic keratins are the building blocks 

is of keratin IFs. Keratins are abundant in soft epithelia such as skin and cornea, hard epithelia such as 
nails and hair, and in epithelia that line internal body cavities. Mutations in keratin genes lead to 
epithelial diseases including epidermolysis bullosa simplex, bullous congenital ichthyosiform 
erythroderma (epidermolytic hyperkeratosis), non-epidermolytic and epidermolytic palmoplanar 
keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and white sponge nevus. Some of 

20 these diseases result in severe skin blistering. (See, e.g., Wawersik, M. et al. (1997) J. Biol. Chem. 
272:32557-32565; and CordenL.D. and W.H. McLean (1996) Exp. Dermatol. 5:297-307.) 

Type III IF proteins include desmin, glial fibrillary acidic protein, vimentin, and peripherin. 
Desmin filaments in muscle cells link myofibrils into bundles and stabilize sarcomeres in contracting 
muscle. Glial fibrillary acidic protein filaments are found in the glial cells that surround neurons and 

25 astrocytes. Vimentin filaments are found in blood vessel endothelial cells, some epithelial cells, and 
mesenchymal cells such as fibroblasts, and are commonly associated with microtubules. Vimentin 
filaments may have roles in keeping the nucleus and other organelles in place in the cell. Type IV IFs 
include the neurofilaments and nestin. Neurofilaments, composed of three polypeptides NF-L, NF-M, 
and NF-H, are frequently associated with microtubules in axons. Neurofilaments are responsible for 

30 the radial growth and diameter of an axon, and ultimately for the speed of nerve impulse transmission. 
Changes in phosphorylation and metabolism of neurofilaments are observed in neurodegenerative 
diseases including amyotrophic lateral sclerosis, Parkinson's disease, and Alzheimer's disease (Julien, 
J.P. and W.E. Mushynski (1998) Prog. Nucleic Acid Res. Mol. Biol. 61:1-23). Type V IFs, the lamins, 
are found in the nucleus where they support the nuclear membrane. 
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IFs have a central a-helical rod region interrupted by short nonhelical linker segments. The rod 
region is bracketed, in most cases, by non-helical head and tail domains. The rod regions of 
intermediate filament proteins associate to form a coiled-coil dimer. A highly ordered assembly process 
leads from the dimers to the IFs. Neither ATP nor GTP is needed for IF assembly, unlike that of 
5 microfilaments and microtubules. 

IF-associated proteins (IFAPs) mediate the interactions of IFs with one another and with other 
cell structures. IFAPs cross-link IFs into a bundle, into a network, or to the plasma membrane, and 
may cross-link IFs to the microfilament and microtubule cytoskeleton. Microtubules and IFs are in 
particular closely associated IFAPs include BPAG1, plakoglobin, desmoplakin I, desmoplakin II, 
10 plectin, ankyrin, filaggrin, and lamin B receptor. 
Cvtoskeletal-Membrane Anchors 

Cytoskeletal fibers are attached to the plasma membrane by specific proteins. These 
attachments are important for maintaining cell shape and for muscle contraction. In erythrocytes, the 
spectrin- actin cytoskeleton is attached to cell membrane by three proteins, band 4.1, ankyrin, and 
is adducin. Defects in this attachment result in abnormally shaped cells which are more rapidly 

degraded by the spleen, leading to anemia. In platelets, the spectrin-actin cytoskeleton is also linked 
to the membrane by ankyrin; a second actin network is anchored to the membrane by filamin. In 
muscle cells the protein dystrophin links actin filaments to the plasma membrane; mutations in the 
dystrophin gene lead to Duchenne muscular dystrophy. In adherens junctions and adhesion plaques 

2 o the peripheral membrane proteins a-actinin and vinculin attach actin filaments to the cell membrane. 

IFs are also attached to membranes by cytoskeletal -membrane anchors. The nuclear lamina is 
attached to the inner surface of the nuclear membrane by the lamin B receptor. Vimentin IFs are 
attached to the plasma membrane by ankyrin and plectin. Desmosome and hemidesmosome 
membrane junctions hold together epithelial cells of organs and skin. These membrane junctions 
25 allow shear forces to be distributed across the entire epithelial cell layer, thus providing strength and 
rigidity to the epithelium. IFs in epithelial cells are attached to the desmosome by plakoglobin and 
desmoplakins. The proteins that link IFs to hemidesmosomes are not known. Desmin IFs surround 
the sarcomere in muscle and are linked to the plasma membrane by paranemin, synemin, and ankyrin. 
Mvosin-related Motor Proteins 

3 o Myosins are actin-activated ATPases, found in eukaryotic cells, that couple hydrolysis of 

ATP with motion. Myosin provides the motor function for muscle contraction and intracellular 
movements such as phagocytosis and rearrangement of cell contents during mitotic cell division 
(cytokinesis). The contractile unit of skeletal muscle, termed the sarcomere, consists of highly ordered 
arrays of thin actin-containing filaments and thick myosin-containing filaments. Crossbridges form 
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between the thick and thin filaments, and the ATP-dependent movement of myosin heads within the 
thick filaments pulls the thin filaments, shortening the sarcomere and thus the muscle fiber. 

Myosins are composed of one or two heavy chains and associated light chains. Myosin 
heavy chains contain an amino-terminal motor or head domain, a neck that is the site of light-chain 
5 binding, and a carboxy-terminal tail domain. The tail domains may associate to form an a -helical 
coiled coil. Conventional myosins, such as those found in muscle tissue, are composed of two 
myosin heavy-chain subunits, each associated with two light-chain subunits that bind at the neck 
region and play a regulatory role. Unconventional myosins, believed to function in intracellular 
motion, may contain either one or two heavy chains and associated light chains. There is evidence for 

10 about 25 myosin heavy chain genes in vertebrates, more than half of them unconventional. 
Dvnein-related Motor Proteins 

Dyneins are (-) end-directed motor proteins which act on microtubules. Two classes of 
dyneins, cytosolic and axonemal, have been identified. Cytosolic dyneins are responsible for 
translocation of materials along cytoplasmic microtubules, for example, transport from the nerve 

is terminal to the cell body and transport of endocytic vesicles to lysosomes. Cytoplasmic dyneins are 

also reported to play a role in mitosis. Axonemal dyneins are responsible for the beating of flagella and 
cilia. Dynein on one microtubule doublet walks along the adjacent microtubule doublet. This sliding 
force produces bending forces that cause the flagellum or cilium to beat. Dyneins have a native mass 
between 1000 and 2000 kDa and contain either two or three force-producing heads driven by the 

2 o hydrolysis of ATP. The heads are linked via stalks to a basal domain which is composed of a highly 
variable number of accessory intermediate and light chains. 
Kinesin-related Motor Proteins 

Kinesins are (+) end-directed motor proteins which act on microtubules. The prototypical 
kinesin molecule is involved in the transport of membrane-bound vesicles and organelles. This function 

25 is particularly important for axonal transport in neurons. Kinesin is also important in all cell types for 
the transport of vesicles from the Golgi complex to the endoplasmic reticulum. This role is critical for 
maintaining the identity and functionality of these secretory organelles. 

Kinesins define a ubiquitous, conserved family of over 50 proteins that can be classified into at 
least 8 subfamilies based on primary amino acid sequence, domain structure, velocity of movement, and 

30 cellular function. (Reviewed in Moore, J.D. and S.A Endow (1996) Bioessays 18:207-219; and Hoyt, 
A.M. (1994) Curr. Opin. Cell Biol. 6:63-68.) The prototypical kinesin molecule is a heterotetramer 
comprised of two heavy polypeptide chains (KHCs) and two light polypeptide chains (KLCs). The 
KHC subunits are typically referred to as "Mnesin." KHC is about 1000 amino acids in length, and 
KLC is about 550 amino acids in length. Two KHCs dimerize to form a rod-shaped molecule with 
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three distinct regions of secondary structure. At one end of the molecule is a globular motor domain 
that functions in ATP hydrolysis and microtubule binding. Kinesin motor domains are highly conserved 
and share over 70% identity. Beyond the motor domain is an a-helical coiled-coil region which 
mediates dimerization. At the other end of the molecule is a fan-shaped tail that associates with 
5 molecular cargo. The tail is formed by the interaction of the KHC C -termini with the two KLCs. 

Members of the more divergent subfamilies of kinesins are called kinesin-related proteins 
(KRPs), many of which function during mitosis in eukaryotes (Hoyt, supraV Some KRPs are required 
for assembly of the mitotic spindle. In vivo and in vitro analyses suggest that these KRPs exert force 
on microtubules that comprise the mitotic spindle, resulting in the separation of spindle poles. 

1 o Phosphorylation of KRP is required for this activity. Failure to assemble the mitotic spindle results in 

abortive mitosis and chromosomal aneuploidy, the latter condition being characteristic of cancer cells. 
In addition, a unique KRP, centromere protein E, localizes to the kinetochore of human mitotic 
chromosomes and may play a role in their segregation to opposite spindle poles. 
Dynamin-related Motor Proteins 
15 Dynamin is a large GTPase motor protein that functions as a "molecular pinchase," 

generating a mechanochemical force used to sever membranes. This activity is important in forming 
clathrin-coated vesicles from coated pits in endocytosis and in the biogenesis of synaptic vesicles in 
neurons. Binding of dynamin to a membrane leads to dynamin* s self-assembly into spirals that may 
act to constrict a flat membrane surface into a tubule. GTP hydrolysis induces a change in 

2 o conformation of the dynamin polymer that pinches the membrane tubule, leading to severing of the 

membrane tubule and formation of a membrane vesicle. Release of GDP and inorganic phosphate 
leads to dynamin disassembly. Following disassembly the dynamin may either dissociate from the 
membrane or remain associated to the vesicle and be transported to another region of the cell. Three 
homologous dynamin genes have been discovered, in addition to several dynamin-related proteins. 

2 5 Conserved dynamin regions are the N-terminal GTP-binding domain, a central pleckstrin homology 
domain that binds membranes, a central coiled-coil region that may activate dynamin' s GTPase 
activity, and a C-terminal proline-rich domain that contains several motifs that bind SH3 domains on 
other proteins. Some dynamin-related proteins do not contain the pleckstrin homology domain or the 
proline-rich domain. (See McNiven, M.A. (1998) Cell 94:151-154; Scaife, R.M. and R.L. Margolis 

30 (1997) Cell. Signal. 9:395-401.) 

The cytoskeleton is reviewed in Lodish, H. et al. (1995) Molecular Cell Biology , Scientific 
American Books, New York NY. 

Ribosomal Molecules 
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SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, and SEQ ID NO:53 
encode, for example, ribosomal molecules. 

Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into ribosomes, 
which are cytoplasmic particles that translate messenger RNA into polypeptides. The eukaryotic 
5 ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, which together form the 
80S ribosome. In addition to the 18S, 28S, 5S, and 5.8S rRNAs, the ribosome also contains more 
than fifty proteins. The ribosomal proteins have a prefix which denotes the subunit to which they 
belong, either L (large) or S (small). Ribosomal protein activities include binding rRNA and 
organizing the conformation of the junctions between rRNA helices (Woodson, S.A. and N.B. 

10 Leontis (1998) Curr. Opin. Struct Biol. 8:294-300; Ramakrishnan, V. and S.W. White (1998) Trends 
Biochem. Sci. 23:208-212.) Three important sites are identified on the ribosome. The aminoacyl- 
tRNA site (A site) is where charged tRNAs (with the exception of the initiator-tRNA) bind on arrival 
at the ribosome. The peptidyl-tRN A site (P site) is where new peptide bonds are formed, as well as 
where the initiator tRNA binds. The exit site (E site) is where deacylated tRNAs bind prior to their 

15 release from the ribosome. (The ribosome is reviewed in Stryer, L. (1995) Biochemistry W.H. 
Freeman and Company, New York NY, pp. 888-908; and Lodish, H. et al. (1995) Molecular Cell 
Biology Scientific American Books, New York NY. pp. 1 19-138.) 

Chromatin Molecules 

2 0 The nuclear DNA of eukaryotes is organized into chromatin. Two types of chromatin are 

observed: euchromatin, some of which may be transcribed, and heterochromatin so densely packed that 
much of it is inaccessible to transcription. Chromatin packing thus serves to regulate protein 
expression in eukaryotes. Bacteria lack chromatin and the chromatin-packing level of gene regulation. 
The fundamental unit of chromatin is the nucleosome of 200 DNA base pairs associated with 

2 5 two copies each of histones H2A, H2B, H3, and H4. Adjascent nucleosomes are linked by another 

class of histones, HI . Low molecular weight non-histone proteins called the high mobility group 
(HMG), associated with chromatin, may function in the unwinding of DNA and stabilization of single- 
stranded DNA- Chromodomain proteins function in compaction of chromatin into its transcriptionally 
silent heterochromatin form. 

3 o During mitosis, all DNA is compacted into heterochromatin and transcription ceases. 

Transcription in interphase begins with the activation of a region of chromatin. Active chromatin is 
decondensed. Decondensation appears to be accompanied by changes in binding coefficient, 
phosphorylation and acetylation states of chromatin histones. HMG proteins HMG13 and HMG17 
selectively bind activated chromatin. Topoisomerases remove superhelical tension on DNA. The 
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activated region decondenses, allowing gene regulatory proteins and transcription factors to assemble 
on the DN A. 

Patterns of chromatin structure can be stably inherited, producing heritable patterns of gene 
expression. In mammals, one of the two X chromosomes in each female cell is inactivated by 
5 condensation to heterochromatin during zygote development. The inactive state of this chromosome is 
inherited, so that adult females are mosaics of clusters of paternal-X and maternal-X clonal cell groups. 
The condensed X chromosome is reactivated in meiosis. 

Chromatin is associated with disorders of protein expression such as thalassemia, a genetic 
anemia resulting from the removal of the locus control region (LCR) required for decondensation of the 
i o globin gene locus. 

For a review of chromatin structure and function see Alberts, B. et al. (1994) Molecular Cell 
Biology , third edition, Garland Publishing, Inc., New York NY, pp. 351-354, 433^*39. 

Electron Transfer Associated Molecules 

is Electron carriers such as cytochromes accept electrons from NADH or FADR, and donate 

them to other electron carriers. Most electron-transferring proteins, except ubiquinone, are prosthetic 
groups such as flavins, heme, FeS clusters, and copper, bound to inner membrane proteins. 
Adrenodoxin, for example, is an FeS protein that forms a complex with NADPH:adrenodoxin 
reductase and cytochrome p450. Cytochromes contain a heme prosthetic group, a porphyrin ring 

20 containing a tightly bound iron atom. Electron transfer reactions play a crucial role in cellular energy 
production. 

Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 
to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 
complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH and FADH 2 

25 to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and P t . 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 

30 transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 

dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 
GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and P s by the F^ ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
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synthesis include the FJF l ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome c x , FeS protein, and cytochrome c oxidase. 

ATP synthesis requires membrane transport enzymes including the phosphate transporter and 
the ATP-ADP antiport protein. The ATP-binding casette (ABC) superfamily has also been suggested 
5 as belonging to the mitochondrial transport group (Hogue, D.L. et al. (1999) J. Mol. Biol. 285:379- 
389). Brown fat uncoupling protein dissipates oxidative energy as heat, and may be involved the fever 
response to infection and trauma (Cannon, B. et al. (1998) Ann. NY Acad. Sci. 856:171-187). 

Mitochondria are oval-shaped organelles comprising an outer membrane, a tightly folded 
inner membrane, an intermembrane space between the outer and inner membranes, and a matrix 
10 inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 
charged molecules to enter the intermembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 

Mitochondria contain a small amount of DNA. Human mitochondrial DNA encodes 13 

15 proteins, 22 tRNAs, and 2 rRNAs. Mitochondrial-DNA encoded proteins include NADH-Q 

reductase, a cytochrome reductase subunit, cytochrome oxidase subunits, and ATP synthase subunits. 

Electron-transfer reactions also occur outside the mitochondria in locations such as the 
endoplasmic reticulum, which plays a crucial role in lipid and protein biosynthesis. Cytochrome b5 
is a central electron donor for various reductive reactions occurring on the cytoplasmic surface of liver 

2 o endoplasmic reticulum. Cytochrome b5 has been found in Golgi, plasma, endoplasmic reticulum 
(ER), and microbody membranes. 

For a review of mitochondrial metabolism and regulation, see Lodish, H. et al. (1995) 
Molecular Cell Biology , Scientific American Books, New York NY, pp. 745-797 and Stryer (1995) 
Biochemistry , W.H. Freeman and Co., San Francisco CA, pp 529-558, 988-989. 

2 5 The majority of mitochondrial proteins are encoded by nuclear genes, are synthesized on 

cytosolic ribosomes, and are imported into the mitochondria. Nuclear-encoded proteins which are 
destined for the mitochondrial matrix typically contain positively-charged amino terminal signal 
sequences. Import of these preproteins from the cytoplasm requires a multisubunit protein complex 
in the outer membrane known as the translocase of outer mitochondrial membrane (TOM; previously 

30 designated MOM; Pfanner, N. et al. (1996) Trends Biochem. Sci. 21 :51-52) and at least three inner 
membrane proteins which comprise the translocase of inner mitochondrial membrane (TIM; 
previously designated MIM; Pfanner, supra) . An inside-negative membrane potential across the inner 
mitochondrial membrane is also required for preprotein import. Preproteins are recognized by surface 
receptor components of the TOM complex and are translocated through a proteinaceous pore formed 

35 by other TOM components. Proteins targeted to the matrix are then recognized by the import 
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machinery of the TIM complex. The import systems of the outer and inner membranes can function 
independently (Segui-Real, B. et aL (1993) EMBO J. 12:221 1-2218). 

Once precursor proteins are in the mitochondria, the leader peptide is cleaved by a signal 
peptidase to generate the mature protein. Most leader peptides are removed in a one step process by a 
5 protease termed mitochondrial processing peptidase (MPP) (Paces, V. et al. (1993) Proc. Natl. Acad. 
Sci. USA 90:5355-5358). In some cases a two-step process occurs in which MPP generates an 
intermediate precursor form which is cleaved by a second enzyme, mitochondrial intermediate 
peptidase, to generate the mature protein. 

Mitochondrial dysfunction leads to impaired calcium buffering, generation of free radicals 

10 that may participate in deleterious intracellular and extracellular processes, changes in mitochondrial 
permeability and oxidative damage which is observed in several neurodegenerative diseases. 
Neurodegenerative diseases linked to mitochondrial dysfunction include some forms of Alzheimer's 
disease, Friedreich's ataxia, familial amyotrophic lateral sclerosis, and Huntington's disease (Beal, 
M.F. (1998) Biochim. Biophys. Acta 1366:21 1-213). The myocardium is heavily dependent on 

1 5 oxidative metabolism, so mitochondrial dysfunction often leads to heart disease (DiMauro, S. and M. 
Hirano (1998) Curr. Opin. Cardiol 13:190-197). Mitochondria are implicated in disorders of cell 
proliferation, since they play an important role in a cell's decision to proliferate or self-destruct 
through apoptosis. The oncoprotein Bcl-2, for example, promotes cell proliferation by stabilizing 
mitochondrial membranes so that apoptosis signals are not released (Susin, S.A. (1998) Biochim. 

20 Biophys. Acta 1366:151-165). 

Transcription Factor Molecules 

SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
25 NO:30, SEQ ID NO:31, SEQ ID NO:32, and SEQ ID NO:33 encode, for example, transcription 
factor molecules. 

Multicellular organisms are comprised of diverse cell types that differ dramatically both in 
structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 

3 o development. Spatial and temporal regulation of gene expression is critical for the control of cell 
proliferation, cell differentiation, apoptosis, and other processes that contribute to organismal 
development Furthermore, gene expression is regulated in response to extracellular signals that 
mediate cell-cell communication and coordinate the activities of different cell types. Appropriate 
gene regulation also ensures that cells function efficiently by expressing only those genes whose 

3 5 functions are required at a given time. 
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Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream regulatory 
regions of a gene in a sequence-specific manner, although some factors bind regulatory elements 
within or downstream of a gene's coding region. Transcription factors may bind to a specific region 
of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) Genes 
IV, Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 

The double helix structure and repeated sequences of DNA create topological and chemical 
features which can be recognized by transcription factors. These features are hydrogen bond donor 
and acceptor groups, hydrophobic patches, major and minor grooves, and regular, repeated stretches 
of sequence which induce distinct bends in the helix. Typically, transcription factors recognize 
specific DNA sequence motifs of about 20 nucleotides in length. Multiple, adjacent transcription 
factor-binding motifs may be required for gene regulation. 

Many transcription factors incorporate DNA-binding structural motifs which comprise either 
a helices or 6 sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 
motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

The helix-turn-helix motif consists of two a helices connected at a fixed angle by a short 
chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins are 
critical for specifying the anterior-posterior body axis during development and are conserved 
throughout the animal kingdom. The Antennapedia and Ultrabithorax proteins of Drosophila 
melanogaster are prototypical homeodomain proteins (Pabo, CO. and R.T. Sauer (1992) Annu. Rev. 
Biochem. 61:1053-1095). 

The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 30 
amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 
sequence pattern, designated C2H2 and C3HC4 ("RING" finger), have been described (Lewin, supra) . 
Zinc finger proteins each contain an a helix and an antiparallel 8 sheet whose proximity and 
conformation are maintained by the zinc ion. Contact with DNA is made by the arginine prece ding 
the a helix and by the second, third, and sixth residues of the a helix. Variants of the zinc finger 
motif include poorly defined cysteine-rich motifs which bind zinc or other metal ions. These motifs 
may not contain histidine residues and are generally nonrepetitive. 

The leucine zipper motif comprises a stretch of amino acids rich in leucine which can form an 
amphipathic a helix. This structure provides the basis for dimerization of two leucine zipper 
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proteins. The region adjacent to the leucine zipper is usually basic, and upon protein dimerization, is 
optimally positioned for binding to the major groove. Proteins containing such motifs are generally 
referred to as bZIP transcription factors. 

The helix-loop-helix motif (HLH) consists of a short a helix connected by a loop to a longer 
5 a helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. The transcription factor Myc contains a prototypical HLH motif. 

Most transcription factors contain characteristic DNA binding motifs, and variations on the 
above motifs and new motifs have been and are currently being characterized (Faisst, S. and S. Meyer 
(1992) Nucleic Acids Res. 20:3-26). 

10 Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 

Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes (Cleary, M.L. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fiise the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 

15 inappropriate gene transcription, potentially contributing to malignancy. 

In addition, the immune system responds to infection or trauma by activating a cascade of 
events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 

2 o regulation of gene expression may result in considerable tissue or organ damage. This damage is well 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
infections (Isselbacher, KJ. et al. (1996) Harrison's Principles of Internal Medicine . 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

Furthermore, the generation of multicellular organisms is based upon the induction and 

25 coordination of cell differentiation at the appropriate stages of development Central to this process is 
differential gene expression, which confers the distinct identities of cells and tissues throughout the 
body. Failure to regulate gene expression during development can result in developmental disorders. 
Human developmental disorders caused by mutations in zinc finger-type transcriptional regulators 
include: urogenenital developmental abnormalities associated with WT1 ; Greig 

30 cephalopolysyndactyly, Pallister-Hall syndrome, and postaxial Polydactyly type A (GLI3); and 
Townes-Brocks syndrome, characterized by anal, renal, limb, and ear abnormalities (SALL1) 
(Engelkamp, D. and V. van Heyningen (1996) Curr. Opin. Genet. Dev. 6:334-342; Kohlhase, J. et al. 
(1999) Am. J. Hum. Genet. 64:435-445). 
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Cell Membrane Molecules 

SEQ ID NO:46, SEQ ID NO:47, and SEQ ID NO:48 encode, for example, cell membrane 
molecules. 

Eukaryotic cells are surrounded by plasma membranes which enclose the cell and maintain an 
5 environment inside the cell that is distinct from its surroundings. In addition, eukaryotic organisms 
are distinct from prokaryotes in possessing many intracellular organelle and vesicle structures. Many 
of the metabolic reactions which distinguish eukaryotic biochemistry from prokaryotic biochemistry 
take place within these structures. The plasma membrane and the membranes surrounding organelles 
and vesicles are composed of phosphoglycerides, fatty acids, cholesterol, phospholipids, glycolipids, 

10 proteoglycans, and proteins. These components confer identity and functionality to the membranes 
with which they associate. 
Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 
are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 

is typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a -helical 
conformation. TM proteins are classified as bitopic (Types I and II) and polytopic (Types III and IV) 
(Singer, S.J. (1990) Annu. Rev. Cell Biol. 6:247-296). Bitopic proteins span the membrane once 
while polytopic proteins contain multiple membrane-spanning segments. TM proteins function as 
cell-surface receptors, receptor-interacting proteins, transporters of ions or metabolites, ion channels, 

20 cell anchoring proteins, and cell type-specific surface antigens. 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, 
and GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. RGD, 
NGR, and GSL motif-containing peptides have been used as drug delivery agents in targeted cancer 

25 treatment of tumor vasculature (Arap, W. et al. (1998) Science 279:377-380). Furthermore, MPs may 
also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), that 
mediate interactions with extracellular or intracellular molecules. 
G-Protein Coupled Receptors 

G-protein coupled receptors (GPCR) are a superfamily of integral membrane proteins which 

3 o transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators of 
inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
N-terminus, and a cytoplasmic C -terminus. Three extracellular loops alternate with three intracellular 
loops to link the seven transmembrane regions. Cysteine disulfide bridges connect the second and 

3 5 third extracellular loops. The most conserved regions of GPCRs are the transmembrane regions and 
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the first two cytoplasmic loops. A conserved, acidic- Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, S. 
and S. Arkinstall (1994) The G-protein Linked Receptor Facts Book . Academic Press, San Diego CA, 
5 pp. 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 
associated with neurological disorders such as schizophrenia, Parkinson's disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 
Scavenger Receptors 

Macrophage scavenger receptors with broad ligand specificity may participate in the binding 

10 of low density lipoproteins (LDL) and foreign antigens. Scavenger receptors types I and II are 

trimeric membrane proteins with each subunit containing a small N-terminal intracellular domain, a 
transmembrane domain, a large extracellular domain, and a C-terminal cysteine-rich domain. The 
extracellular domain contains a short spacer region, an a-helical coiled-coil region, and a triple helical 
collagen-like region. These receptors have been shown to bind a spectrum of ligands, including 

is chemically modified lipoproteins and albumin, polyribonucleotides, polysaccharides, phospholipids, 
and asbestos (Matsumoto, A. et al. (1990) Proc. Natl. Acad. Sci. USA 87:9133-9137; and Elomaa, O. 
et al. (1995) Cell 80:603-609). The scavenger receptors are thought to play a key role in 
atherogenesis by mediating uptake of modified LDL in arterial walls, and in host defense by binding 
bacterial endotoxins, bacteria, and protozoa. 

20 Tetraspan Family Proteins 

The transmembrane 4 superfamily (TM4SF) or tetraspan family is a multigene family 
encoding type III integral membrane proteins (Wright, M.D. and M.G. Tomlinson (1994) Immunol. 
Today 15:588-594). The TM4SF is comprised of membrane proteins which traverse the cell 
membrane four times. Members of the TM4SF include platelet and endothelial cell membrane 

25 proteins, melanoma-associated antigens, leukocyte surface glycoproteins, colonal carcinoma antigens, 
tumor-associated antigens, and surface proteins of the schistosome parasites (Jankowski, S.A. (1994) 
Oncogene 9:1205-1211). Members of the TM4SF share about 25-30% amino acid sequence identity 
with one another. 

A number of TM4SF members have been implicated in signal transduction, control of cell 
30 adhesion, regulation of cell growth and proliferation, including development and oncogenesis, and 
cell motility, including tumor cell metastasis. Expression of TM4SF proteins is associated with a 
variety of tumors and the level of expression may be altered when cells are growing or activated. 
Tumor Antigens 

Tumor antigens are cell surface molecules that are differentially expressed in tumor cells 
35 relative to normal cells. Tumor antigens distinguish tumor cells immunologically from normal cells 
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and provide diagnostic and therapeutic targets for human cancers (Takagi, S. et al. (1995) Int. J. 
Cancer 61:706-715; Liu, E. et al. (1992) Oncogene 7:1027-1032). 
Leukocyte Antigens 

Other types of cell surface antigens include those identified on leukocytic cells of the immune 
system. These antigens have been identified using systematic, monoclonal antibody (mAb)-based 
"shot gun" techniques. These techniques have resulted in the production of hundreds of mAbs 
directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 
"clusters of differentiation" based on common immunocytochemical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "cluster of differentiation" or "CD" 
designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 
verified by standard molecular biology techniques. CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GPI). 
(Reviewed in Barclay, A.N. et al. (1995) The Leucocyte Antigen Facts Book . Academic Press, San 
Diego CA, pp. 17-20.) 
Ion Channels 

Ion channels are found in the plasma membranes of virtually every cell in the body. For 
example, chloride channels mediate a variety of cellular functions including regulation of membrane 
potentials and absorption and secretion of ions across epithelial membranes. Chloride channels also 
regulate the pH of organelles such as the Golgi apparatus and endosomes (see, e.g., Greger, R. (1988) 
Annu. Rev. Physiol. 50:111-122). Electrophysiological and pharmacological properties of chloride 
channels, including ion conductance, current-voltage relationships, and sensitivity to modulators, 
suggest that different chloride channels exist in muscles, neurons, fibroblasts, epithelial cells, and 
lymphocytes. 

Many ion channels have sites for phosphorylation by one or more protein kinases including 
protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of which regulate ion 
channel activity in cells. Inappropriate phosphorylation of proteins in cells has been linked to 
changes in cell cycle progression and cell differentiation. Changes in the cell cycle have been linked 
to induction of apoptosis or cancer. Changes in cell differentiation have been linked to diseases and 
disorders of the reproductive system, immune system, skeletal muscle, and other organ systems. 
Proton Pumps 

Proton ATPases comprise a large class of membrane proteins that use the energy of ATP 
hydrolysis to generate an electrochemical proton gradient across a membrane. The resultant gradient 
may be used to transport other ions across the membrane (Na + , K + , or CI*) or to maintain organelle 
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pH. Proton ATPases are further subdivided into the mitochondrial F- ATPases, the plasma membrane 
ATPases, and the vacuolar ATPases. The vacuolar ATPases establish and maintain an acidic pH 
within various organelles involved in the processes of endocytosis and exocytosis (Mellman, I. et al. 
(1986) Annu. Rev. Biochem. 55:663-700). 

Proton-coupled, 12 membrane-spanning domain transporters such as PEPT 1 and PEPT 2 are 
responsible for gastrointestinal absorption and for renal reabsorption of peptides using an 
electrochemical H* gradient as the driving force. Another type of peptide transporter, the TAP 
transporter, is a heterodimer consisting of TAP 1 and TAP 2 and is associated with antigen 
processing. Peptide antigens are transported across the membrane of the endoplasmic reticulum by 
TAP so they can be expressed on the cell surface in association with MHC molecules. Each TAP 
protein consists of multiple hydrophobic membrane spanning segments and a highly conserved 
ATP-binding cassette (Boll, M. et al. (1996) Proc. Natl. Acad. Sci. USA 93:284-289). Pathogenic 
microorganisms, such as herpes simplex virus, may encode inhibitors of TAP-mediated peptide 
transport in order to evade immune surveillance (Marusina, K. and J.J Manaco (1996) Curr. Opin. 
Hematol. 3:19-26). 
ABC Transporters 

The ATP-binding cassette (ABC) transporters, also called the "traffic ATPases", comprise a 
superf amily of membrane proteins that mediate transport and channel functions in prokaryotes and 
eukaryotes (Higgins, C.F. (1992) Annu. Rev. Cell Biol. 8:67-113). ABC proteins share a similar 
overall structure and significant sequence homology. All ABC proteins contain a conserved domain 
of approximately two hundred amino acid residues which includes one or more nucleotide binding 
domains. Mutations in ABC transporter genes are associated with various disorders, such as 
hyperbilirubinemia II/Dubin- Johnson syndrome, recessive Stargardt's disease, X-linked 
adrenoleukodystrophy, multidrug resistance, celiac disease, and cystic fibrosis. 
Peripheral and Anchored Membrane Proteins 

Some membrane proteins are not membrane-spanning but are attached to the plasma 
membrane via membrane anchors or interactions with integral membrane proteins. Membrane 
anchors are covalently joined to a protein post-translationally and include such moieties as prenyl, 
myristyl, and glycosylphosphatidyl inositol groups. Membrane localization of peripheral and 
anchored proteins is important for their function in processes such as receptor-mediated signal 
transduction. For example, prenylation of Ras is required for its localization to the plasma membrane 
and for its normal and oncogenic functions in signal transduction. 
Vesicle Coat Proteins 

Intercellular communication is essential for the development and survival of multicellular 
organisms. Cells communicate with one another through the secretion and uptake of protein 
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signaling molecules. The uptake of proteins into the cell is achieved by the endocytic pathway, in 
which the interaction of extracellular signaling molecules with plasma membrane receptors results in 
the formation of plasma membrane-derived vesicles that enclose and transport the molecules into the 
cytosol. These transport vesicles fiise with and mature into endosomal and lysosomal (digestive) 
5 compartments. The secretion of proteins from the cell is achieved by exocytosis, in which molecules 
inside of the cell proceed through the secretory pathway. In this pathway, molecules transit from the 
ER to the Golgi apparatus and finally to the plasma membrane, where they are secreted from the cell. 

Several steps in the transit of material along the secretory and endocytic pathways require the 
formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic reticulum 

10 (tER), the rim of Golgi cisternae, the face of the Trans-Golgi Network (TGN), the plasma membrane 
(PM), and tubular extensions of the endosomes. Vesicle formation occurs when a region of 
membrane buds off from the donor organelle. The membrane-bound vesicle contains proteins to be 
transported and is surrounded by a proteinaceous coat, the components of which are recruited from 
the cytosol. Two different classes of coat protein have been identified. Clathrin coats form on 

15 vesicles derived from the TGN and PM, whereas coatomer (COP) coats form on vesicles derived 
from the ER and Golgi. COP coats can be further classified as COPI, involved in retrograde traffic 
through the Golgi and from the Golgi to the ER, and COPII, involved in anterograde traffic from the 
ER to the Golgi (Mellman, supra! 

In clathrin-based vesicle formation, adapter proteins bring vesicle cargo and coat proteins 

2 o together at the surface of the budding membrane. Adapter protein- 1 and -2 select cargo from the 
TGN and plasma membrane, respectively, based on molecular information encoded on the 
cytoplasmic tail of integral membrane cargo proteins. Adapter proteins also recruit clathrin to the bud 
site. Clathrin is a protein complex consisting of three large and three small polypeptide chains 
arranged in a three-legged structure called a triskelion. Multiple triskelions and other coat proteins 

2 5 appear to self -assemble on the membrane to form a coated pit. This assembly process may serve to 

deform the membrane into a budding vesicle. GTP-bound ADP-ribosylation factor ( Arf) is also 
incorporated into the coated assembly. Another small G-protein, dynamin, forms a ring complex 
around the neck of the forming vesicle and may provide the mechanochemical force to seal the bud, 
thereby releasing the vesicle. The coated vesicle complex is then transported through the cytosol. 

3 o During the transport process, Arf-bound GTP is hydrolyzed to GDP, and the coat dissociates from the 

transport vesicle (West, M.A. et al. (1997) J. Cell Biol. 138:1239-1254). 

Vesicles which bud from the ER and the Golgi are covered with a protein coat similar to the 
clathrin coat of endocytic and TGN vesicles. The coat protein (COP) is assembled from cytosolic 
precursor molecules at specific budding regions on the organelle. The COP coat consists of two 
3 5 major components, a G-protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar 
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complex of seven proteins, termed alpha-, beta-, beta'-, gamma-, delta-, epsilon- and zeta-COP. The 
coatomer complex binds to dilysine motifs contained on the cytoplasmic tails of integral membrane 
proteins. These include the KKXX retrieval motif of membrane proteins of the ER and 
dibasic/diphenylamine motifs of members of the p24 family. Hie p24 family of type I membrane 
5 proteins represent the major membrane proteins of COPI vesicles (Harter, C. and F.T. Wieland (1998) 
Proc. Natl. Acad. Sci. USA 95:11649-11654). 

Organelle Associated Molecules 

SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID 
10 NO:59, SEQ ID NO:60, SEQ ID NO:61 , SEQ ID NO:62, and SEQ ID NO:63 encode, for example, 

organelle associated molecules. 

Eukaryotic cells are organized into various cellular organelles which has the effect of 

separating specific molecules and their functions from one another and from the cytosoL Within the 

cell, various membrane structures surround and define these organelles while allowing them to 
15 interact with one another and the cell environment through both active and passive transport 

processes. Important cell organelles include the nucleus, the Golgi apparatus, the endoplasmic 

reticulum, mitochondria, peroxisomes, lysosomes, endosomes, and secretory vesicles. 

Nucleus 

The cell nucleus contains all of the genetic information of the cell in the form of DNA, and 
20 the components and machinery necessary for replication of DNA and for transcription of DNA into 
RNA. (See Alberts, B. et al. (1994) Molecular Biology of the Cell , Garland Publishing Inc., New 
York NY, pp. 335-399.) DNA is organized into compact structures in the nucleus by interactions 
with various DNA-binding proteins such as histones and non-histone chromosomal proteins. 
DNA-specific nucleases, DNAses, partially degrade these compacted structures prior to DNA 

2 5 replication or transcription. DNA replication takes place with the aid of DNA helicases which 

unwind the double-stranded DNA helix, and DNA polymerases that duplicate the separated DNA 
strands. 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
these proteins function as transcription factors that initiate, activate, repress, or terminate gene 

3 o transcription. Transcription factors generally bind to the promoter, enhancer, and upstream 

regulatory regions of a gene in a sequence-specific manner, although some factors bind regulatory 
elements within or downstream of a gene's coding region. Transcription factors may bind to a specific 
region of DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) 
Genes IV , Oxford University Press, New York NY, and Cell Press, Cambridge MA, pp. 554-570.) 
35 Many transcription factors incorporate DNA-binding structural motifs which comprise either a 
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helices or B sheets that bind to the major groove of DN A. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these 
motifs may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 
Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
5 Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes (Cleary, MX. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 
inappropriate gene transcription, potentially contributing to malignancy. 

10 In addition, the immune system responds to infection or trauma by activating a cascade of 

events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is well 

15 documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
infections (Isselbacher, K.J. et al. (1996) Harrison's Principles of Internal Medicine . 13/e, McGraw 
Hill, Inc. and Teton Data Systems Software). 

Transcription of DNA into RNA also takes place in the nucleus catalyzed by RNA 
polymerases. Three types of RNA polymerase exist. RNA polymerase I makes large ribosomal 

2 o RNAs, while RNA polymerase III makes a variety of small, stable RNAs including 5S ribosomal 
RNA and the transfer RNAs (tRNA). RNA polymerase II transcribes genes that will be translated 
into proteins. The primary transcript of RNA polymerase II is called heterogenous nuclear RNA 
(hnRNA), and must be further processed by splicing to remove non-coding sequences called introns. 
RNA splicing is mediated by small nuclear ribonucleoprotein complexes, or snRNPs, producing 

2 5 mature messenger RNA (mRNA) which is then transported out of the nucleus for translation into 

proteins. 
Nucleolus 

The nucleolus is a highly organized subcompartment in the nucleus that contains high 
concentrations of RNA and proteins and functions mainly in ribosomal RNA synthesis and assembly 
30 (Alberts, et al. supra , pp. 379-382). Ribosomal RNA (rRNA) is a structural RNA that is complexed 
with proteins to form ribonucleoprotein structures called ribosomes. Ribosomes provide the platform 
on which protein synthesis takes place. 

Ribosomes are assembled in the nucleolus initially from a large, 45S rRNA combined with a 
variety of proteins imported from the cytoplasm, as well as smaller, 5S rRNAs. Later processing of 

3 5 the immature ribosome results in formation of smaller ribosomal subunits which are transported from 
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the nucleolus to the cytoplasm where they are assembled into functional ribosomes. 
Endoplasmic Reticulum 

In eukaryotes, proteins are synthesized within the endoplasmic reticulum (ER), delivered from 
the ER to the Golgi apparatus for post-translational processing and sorting, and transported from the 
5 Golgi to specific intracellular and extracellular destinations. Synthesis of integral membrane proteins, 
secreted proteins, and proteins destined for the lumen of a particular organelle occurs on the rough 
endoplasmic reticulum (ER). The rough ER is so named because of the rough appearance in electron 
micrographs imparted by the attached ribosomes on which protein synthesis proceeds. Synthesis of 
proteins destined for the ER actually begins in the cytosol with the synthesis of a specific signal 

10 peptide which directs the growing polypeptide and its attached ribosome to the ER membrane where 
the signal peptide is removed and protein synthesis is completed. Soluble proteins destined for the 
ER lumen, for secretion, or for transport to the lumen of other organelles pass completely into the ER 
lumen. Transmembrane proteins destined for the ER or for other cell membranes are translocated 
across the ER membrane but remain anchored in the lipid bilayer of the membrane by one or more 

is membrane-spanning oc-helical regions. 

Translocated polypeptide chains destined for other organelles or for secretion also fold and 
assemble in the ER lumen with the aid of certain "resident" ER proteins. Protein folding in the ER is 
aided by two principal types of protein isomerases, protein disulfide isomerase (PDI), and peptidyl- 
prolyl isomerase (PPI). PE>I catalyzes the oxidation of free sulfhydryl groups in cysteine residues to 

20 form intramolecular disulfide bonds in proteins. PPI, an enzyme that catalyzes the isomerization of 
certain proline imide bonds in oligopeptides and proteins, is considered to govern one of the rate 
limiting steps in the folding of many proteins to their final functional conformation. The cyclophilins 
represent a major class of PPI that was originally identified as the major receptor for the 
immunosuppressive drug cyclosporin A (Handschumacher, R.E. et al. (1984) Science 226:544-547). 

25 Molecular "chaperones" such as BiP (binding protein) in the ER recognize incorrectly folded proteins 
as well as proteins not yet folded into their final form and bind to them, both to prevent improper 
aggregation between them, and to promote proper folding. 

The "N-linked" glycosylation of most soluble secreted and membrane-bound proteins by 
oligosacchrides linked to asparagine residues in proteins is also performed in the ER. This reaction is 

30 catalyzed by a membrane-bound enzyme, oligosaccharyl transferase. 
Golgi Apparatus 

The Golgi apparatus is a complex structure that lies adjacent to the ER in eukaryotic cells and 
serves primarily as a sorting and dispatching station for products of the ER (Alberts, et al. supra , pp. 
600-610). Additional posttranslational processing, principally additional glycosylation, also occurs in 
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the Golgi. Indeed, the Golgi is a major site of carbohydrate synthesis, including most of the 
glycosaminoglycans of the extracellular matrix. N-linked oligosaccharides, added to proteins in the 
ER, are also further modified in the Golgi by the addition of more sugar residues to form complex N- 
linked oligosaccharides. "O-linked" glycosylation of proteins also occurs in the Golgi by the 
5 addition of N-acetylgalactosamine to the hydroxyl group of a serine or threonine residue followed by 
the sequential addition of other sugar residues to the first. This process is catalyzed by a series of 
glycosyltransferases each specific for a particular donor sugar nucleotide and acceptor molecule 
(Lodish, H. et al. (1995) Molecular Cell Biology , W.H. Freeman and Co., New York NY, pp.700- 
708). In many cases, both N- and O-linked oligosaccharides appear to be required for the secretion of 

10 proteins or the movement of plasma membrane glycoproteins to the cell surface. 

The terminal compartment of the Golgi is the Trans-Golgi Network (TGN), where both 
membrane and lumenal proteins are sorted for their final destination. Transport (or secretory) vesicles 
destined for intracellular compartments, such as lysosomes, bud off of the TGN. Other transport 
vesicles bud off containing proteins destined for the plasma membrane, such as receptors, adhesion 

15 molecules, and ion channels, and secretory proteins, such as hormones, neurotransmitters, and digestive 
enzymes. 
Vacuoles 

The vacuole system is a collection of membrane bound compartments in eukaryotic cells that 
functions in the processes of endocytosis and exocytosis. They include phagosomes, lysosomes, 
20 endosomes, and secretory vesicles. Endocytosis is the process in cells of internalizing nutrients, solutes 
or small particles (pinocytosis) or large particles such as internalized receptors, viruses, bacteria, or 
bacterial toxins (phagocytosis). Exocytosis is the process of transporting molecules to the cell surface. 
It facilitates placement or localization of membrane-bound receptors or other membrane proteins and 
secretion of hormones, neurotransmitters, digestive enzymes, wastes, etc. 

2 5 A common property of all of these vacuoles is an acidic pH environment ranging from 

approximately pH 4.5-5.0. This acidity is maintained by the presence of a proton ATPase that uses the 
energy of ATP hydrolysis to generate an electrochemical proton gradient across a membrane (Mellman, 
L et al. (1986) Annu. Rev. Biochem. 55:663-700). Eukaryotic vacuolar proton ATPase (vp-ATPase) is 
a multimeric enzyme composed of 3-10 different subunits. One of these subunits is a highly 

3 o hydrophobic polypeptide of approximately 1 6 kDa that is similar to the proteolipid component of vp- 

ATPases from eubacteria, fungi, and plant vacuoles (Mandel, M. et al. (1988) Proc. Natl. Acad. Sci. 
USA 85:5521-5524). The 16 kDa proteolipid component is the major subunit of the membrane portion 
of vp-ATPase and functions in the transport of protons across the membrane. 
Lysosomes 
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Lysosomes are membranous vesicles containing various hydrolytic enzymes used for the 
controlled intracellular digestion of macromolecules. Lysosomes contain some 40 types of enzymes 
including proteases, nucleases, glycosidases, lipases, phospholipases, phosphatases, and sulfatases, all 
of which are acid hydrolases that function at a pH of about 5. Lysosomes are surrounded by a unique 
5 membrane containing transport proteins that allow the final products of macromolecule degradation, 
such as sugars, amino acids, and nucleotides, to be transported to the cytosol where they may be 
either excreted or reutilized by the cell. A vp-ATPase, such as that described above, maintains the 
acidic environment necessary for hydrolytic activity (Alberts, supra, pp. 610-61 1). 
Endosomes 

10 Endosomes are another type of acidic vacuole that is used to transport substances from the 

cell surface to the interior of the cell in the process of endocytosis. Like lysosomes, endosomes have 
an acidic environment provided by a vp-ATPase (Alberts et al. supra, pp. 610-618). Two types of 
endosomes are apparent based on tracer uptake studies that distinguish their time of formation in the 
cell and their cellular location. Early endosomes are found near the plasma membrane and appear to 

1 5 function primarily in the recycling of internalized receptors back to the cell surface. Late endosomes 
appear later in the endocytic process close to the Golgi apparatus and the nucleus, and appear to be 
associated with delivery of endocytosed material to lysosomes or to the TGN where they may be 
recycled. Specific proteins are associated with particular transport vesicles and their target 
compartments that may provide selectivity in targeting vesicles to their proper compartments. A 

20 cytosolic prenylated GTP-binding protein, Rab, is one such protein. Rabs 4, 5, and 1 1 are associated 
with the early endosome, whereas Rabs 7 and 9 associate with the late endosome. 
Mitochondria 

Mitochondria are oval-shaped organelles comprising an outer membrane, a tightly folded 
inner membrane, an intermembrane space between the outer and inner membranes, and a matrix 
25 inside the inner membrane. The outer membrane contains many porin molecules that allow ions and 
charged molecules to enter the intermembrane space, while the inner membrane contains a variety of 
transport proteins that transfer only selected molecules. Mitochondria are the primary sites of energy 
production in cells. 

Energy is produced by the oxidation of glucose and fatty acids. Glucose is initially converted 
3 0 to pyruvate in the cytoplasm. Fatty acids and pyruvate are transported to the mitochondria for 

complete oxidation to C0 2 coupled by enzymes to the transport of electrons from NADH and FADH 2 
to oxygen and to the synthesis of ATP (oxidative phosphorylation) from ADP and P £ . 

Pyruvate is transported into the mitochondria and converted to acetyl-CoA for oxidation via 
the citric acid cycle, involving pyruvate dehydrogenase components, dihydrolipoyl transacetylase, and 
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dihydrolipoyl dehydrogenase. Enzymes involved in the citric acid cycle include: citrate synthetase, 
aconitases, isocitrate dehydrogenase, alpha-ketoglutarate dehydrogenase complex including 
transsuccinylases, succinyl CoA synthetase, succinate dehydrogenase, fumarases, and malate 
dehydrogenase. Acetyl CoA is oxidized to C0 2 with concomitant formation of NADH, FADH 2 , and 
5 GTP. In oxidative phosphorylation, the transfer of electrons from NADH and FADH 2 to oxygen by 
dehydrogenases is coupled to the synthesis of ATP from ADP and Pj by the FqFj ATPase complex in 
the mitochondrial inner membrane. Enzyme complexes responsible for electron transport and ATP 
synthesis include the ATPase complex, ubiquinone(CoQ)-cytochrome c reductase, ubiquinone 
reductase, cytochrome b, cytochrome c v FeS protein, and cytochrome c oxidase. 
10 Peroxisomes 

Peroxisomes, like mitochondria, are a major site of oxygen utilization. They contain one or 
more enzymes, such as catalase and urate oxidase, that use molecular oxygen to remove hydrogen 
atoms from specific organic substrates in an oxidative reaction that produces hydrogen peroxide 
(Alberts, supra , pp. 574-577). Catalase oxidizes a variety of substrates including phenols, formic 

15 acid, formaldehyde, and alcohol and is important in peroxisomes of liver and kidney cells for 

detoxifying various toxic molecules that enter the bloodstream. Another major function of oxidative 
reactions in peroxisomes is the breakdown of fatty acids in a process called P oxidation. P oxidation 
results in shortening of the alkyl chain of fatty acids by blocks of two carbon atoms that are converted 
to acetyl CoA and exported to the cytosol for reuse in biosynthetic reactions. 

20 Also like mitochondria, peroxisomes import their proteins from the cytosol using a specific 

signal sequence located near the C-terminus of the protein. The importance of this import process is 
evident in the inherited human disease Zellweger syndrome, in which a defect in importing proteins 
into perixosomes leads to a perixosomal deficiency resulting in severe abnormalities in the brain, 
liver, and kidneys, and death soon after birth. One form of this disease has been shown to be due to a 

25 mutation in the gene encoding a perixosomal integral membrane protein called peroxisome assembly 
factor-1. 

The discovery of new human molecules satisfies a need in the art by providing new 
compositions which are useful in the diagnosis, study, prevention, and treatment of diseases associated 
with, as well as effects of exogenous compounds on, the expression of human molecules. 



30 



SUMMARY OF THE INVENTION 

The present invention relates to nucleic acid sequences comprising human diagnostic and 
therapeutic polynucleotides (dithp) as presented in the Sequence Listing. Some of the dithp uniquely 
identify genes encoding human structural, functional, and regulatory molecules. 
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The invention provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO: 1-71 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71; c) a 
5 polynucleotide sequence complementary to a); d) a polynucleotide sequence complementary to b); and 
e) an RNA equivalent of a) through d). In one alternative, the polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71. In another alternative, 
the polynucleotide comprises at least 60 contiguous nucleotides of a polynucleotide sequence selected 
from the group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID 

10 NO: 1-71; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71; c) a polynucleotide 
sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA 
equivalent of a) through d). The invention further provides a composition for the detection of 
expression of human diagnostic and therapeutic polynucleotides, comprising at least one isolated 

15 polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-71 ; c) a polynucleotide sequence complementary to 
a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d); and a 

2 o detectable label. 

The invention also provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide comprising a polynucleotide sequence selected from the group consisting of a) a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71 ; b) a naturally 
occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide sequence 

25 selected from the group consisting of SEQ ID NO:l-71 ; c) a polynucleotide sequence complementary to 
a); d) a polynucleotide sequence complementary to b); and e) an RNA equivalent of a) through d). The 
method comprises a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 

30 is formed between said probe and said target polynucleotide, and b) detecting the presence or absence of 
said hybridization complex, and, optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 30 contiguous nucleotides. In another alternative, the probe comprises at least 60 
contiguous nucleotides. 

The invention further provides a recombinant polynucleotide comprising a promoter sequence 
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operably linked to an isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
71 ; b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71 ; c) a polynucleotide 
5 sequence complementary to a); d) a polynucleotide sequence complementary to b); and e) an RNA 
equivalent of a) through d). In one alternative, the invention provides a cell transformed with the 
recombinant polynucleotide. In another alto-native, the invention provides a transgenic organism 
comprising the recombinant polynucleotide. In a further alternative, the invention provides a method 
for producing a human diagnostic and therapeutic polypeptide, the method comprising a) culturing a 

10 cell undo: conditions suitable for expression of the human diagnostic and therapeutic polypeptide, 
wherein said cell is transformed with the recombinant polynucleotide, and b) recovering the human 
diagnostic and therapeutic polypeptide so expressed. 

The invention also provides a purified human diagnostic and therapeutic polypeptide (DITHP) 
encoded by at least one polynucleotide comprising a polynucleotide sequence selected from the group 

1 5 consisting of SEQ ID NO:l -71 . Additionally, the invention provides an isolated antibody which 

specifically binds to the human diagnostic and therapeutic polypeptide. The invention further provides 
a method of identifying a test compound which specifically binds to the human diagnostic and 
therapeutic polypeptide, the method comprising the steps of a) providing a test compound; b) combining 
the human diagnostic and therapeutic polypeptide with the test compound for a sufficient time and 

20 under suitable conditions for binding; and c) detecting binding of the human diagnostic and therapeutic 
polypeptide to the test compound, thereby identifying the test compound which specifically binds the 
human diagnostic and therapeutic polypeptide. 

The invention further provides a microarray wherein at least one element of the microarray is 
an isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide comprising 

25 a polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1-71; b) a naturally occurring polynucleotide sequence having 
at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:l-71; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 
complementary to b); and e) an RNA equivalent of a) through d). The invention also provides a method 

30 for generating a transcript image of a sample which contains polynucleotides. The method comprises a) 
labeling the polynucleotides of the sample, b) contacting the elements of the microarray with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a hybridization complex, 
and c) quantifying the expression of the polynucleotides in the sample. 

Additionally, the invention provides a method for screening a compound for effectiveness in 
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altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of a) a polynucleotide sequence selected 
from the group consisting of SEQ ID NO: 1-71; b) a naturally occurring polynucleotide sequence having 
at least 90% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ 
5 ID NO: 1-71 ; c) a polynucleotide sequence complementary to a); d) a polynucleotide sequence 

complementary to b); and e) an RNA equivalent of a) through d). The method comprises a) exposing a 
sample comprising the target polynucleotide to a compound, and b) detecting altered expression of the 
target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said method 
10 comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:l- 
71 ; ii) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
15 polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71; iii) a polynucleotide 
sequence complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA 
equivalent of i)-iv). Hybridization occurs under conditions whereby a specific hybridization complex 
is formed between said probe and a target polynucleotide in the biological sample, said target 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of i) a 

2 o polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 -71 ; ii) a naturally 

occurring polynucleotide sequence having at least 90% sequence identity to a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:l-71; iii) a polynucleotide sequence 
complementary to i), iv) a polynucleotide sequence complementary to ii), and v) an RNA equivalent 
of i)-iv), and alternatively, the target polynucleotide comprises a fragment of a polynucleotide sequence 
25 selected from the group consisting of i-v above; c) quantifying the amount of hybridization complex; 
and d) comparing the amount of hybridization complex in the treated biological sample with the amount 
of hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is indicative of toxicity of the test compound- 

3 0 DESCRIPTION OF THE TABLES 

Table 1 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with their 
GenBank hits (GI Numbers), probability scores, and functional annotations corresponding to the 
Gen Bank hits. 
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Table 2 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
nucleotide positions. The reading frames of the polynucleotide segments and the Pfam hits, Pfam 
5 descriptions, and E-values corresponding to the polypeptide domains encoded by the polynucleotide 
segments are indicated. 

Table 3 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
polynucleotide segments of each template sequence as defined by the indicated "start" and "stop" 
10 nucleotide positions. The reading frames of the polynucleotide segments are shown, and the 
polypeptides encoded by the polynucleotide segments constitute either signal peptide (SP) or 
transmembrane (TM) domains, as indicated. 

Table 4 shows the sequence identification numbers (SEQ ID NO:s) and template identification 
numbers (template IDs) corresponding to the polynucleotides of the present invention, along with 
15 component sequence identification numbers (component IDs) corresponding to each template. The 

component sequences, which were used to assemble the template sequences, are defined by the indicated 
"start" and "stop" nucleotide positions along each template. 

Table 5 shows the tissue distribution profiles for the templates of the invention. 
Table 6 summarizes the bioinformatics tools which are useful for analysis of the 
20 polynucleotides of the present invention. The first column of Table 6 lists analytical tools, programs, 
and algorithms, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
2 5 two sequences). 

DETAILED DESCRIPTION OF THE INVENTION 

Before the nucleic acid sequences and methods are presented, it is to be understood that this 
invention is not limited to the particular machines, methods, and materials described. Although 
30 particular embodiments are described, machines, methods, and materials similar or equivalent to these 
embodiments may be used to practice the invention. The preferred machines, methods, and materials 
set forth are not intended to limit the scope of the invention which is limited only by the appended 
claims. 

The singular forms "a", "an", and "the" include plural reference unless the context clearly 
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dictates otherwise. All technical and scientific terms have the meanings commonly understood by one 
of ordinary skill in the art. All publications are incorporated by reference for the purpose of describing 
and disclosing the cell lines, vectors, and methodologies which are presented and which might be used in 
connection with the invention. Nothing in the specification is to be construed as an admission that the 
5 invention is not entitled to antedate such disclosure by virtue of prior invention. 

Definitions 

As used herein, the lower case "dithp" refers to a nucleic acid sequence, while the upper case 
"DITHP" refers to an amino acid sequence encoded by dithp. A "full-length" dithp refers to a nucleic 

1 o acid sequence containing the entire coding region of a gene endogenously expressed in human tissue. 

"Adjuvants" are materials such as Freund's adjuvant, mineral gels (aluminum hydroxide), and 
surface active substances (lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol) which may be administered to increase a host's immunological 
response. 

15 "Allele" refers to an alternative form of a nucleic acid sequence. Alleles result from a 

"mutation," a change or an alternative reading of the genetic code. Any given gene may have none, one, 
or many allelic forms. Mutations which give rise to alleles include deletions, additions, or substitutions 
of nucleotides. Each of these changes may occur alone, or in combination with the others, one or more 
times in a given nucleic acid sequence. The present invention encompasses allelic dithp. 

20 "Amino acid sequence" refers to a peptide, a polypeptide, or a protein of either natural or 

synthetic origin. The amino acid sequence is not limited to the complete, endogenous amino acid 
sequence and may be a fragment, epitope, variant, or derivative of a protein expressed by a nucleic acid 
sequence. 

"Amplification" refers to the production of additional copies of a sequence and is carried out 

2 5 using polymerase chain reaction (PCR) technologies well known in the art. 

"Antibody" refers to intact molecules as well as to fragments thereof, such as Fab, F(ab*>2» and 
Fv fragments, which are capable of binding the epitopic determinant. Antibodies that bind DITHP 
polypeptides can be prepared using intact polypeptides or using fragments containing small peptides of 
interest as the immunizing antigen. The polypeptide or peptide used to immunize an animal (e.g., a 

3 o mouse, a rat, or a rabbit) can be derived from the translation of RN A, or synthesized chemically, and 

can be conjugated to a carrier protein if desired. Commonly used carriers that are chemically coupled 
to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet hemocyanin (KLH). The 
coupled peptide is then used to immunize the animal. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
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sequence. The antisense sequence may include DNA, RNA, or any nucleic acid mimic or analog such 
as peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2 -methoxyethyl sugars or 2 -methoxyethoxy sugars; or oligonucleotides having 
5 modified bases such as 5-methyl cytosine, 2 -deoxyuracil, or 7-deaza-2 -deoxyguanosine. 

"Antisense sequence" refers to a sequence capable of specifically hybridizing to a target 
sequence. The antisense sequence can be DN A, RNA, or any nucleic acid mimic or analog. 

"Antisense technology" refers to any technology which relies on the specific hybridization of an 
antisense sequence to a target sequence. 
10 A "bin" is a portion of computer memory space used by a computer program for storage of 

data, and bounded in such a manner that data stored in a bin may be retrieved by the program. 

"Biologically active" refers to an amino acid sequence having a structural, regulatory, or 
biochemical function of a naturally occurring amino acid sequence. 

"Clone joining" is a process for combining gene bins based upon the bins' containing sequence 
1 5 information from the same clone. The sequences may assemble into a primary gene transcript as well 
as one or more splice variants. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing (5'-A-G-T-3' pairs with its complement 3*-T-C-A-5*). 

A "component sequence" is a nucleic acid sequence selected by a computer program such as 
2 o PHRED and used to assemble a consensus or template sequence from one or more component 
sequences. 

A "consensus sequence" or "template sequence" is a nucleic acid sequence which has been 
assembled from overlapping sequences, using a compute program for fragment assembly such as the 
GEL VIEW fragment assembly system (Genetics Computer Group (GCG), Madison WI) or using a 
2 5 relational database management system (RDMS). 

"Conservative amino acid substitutions" are those substitutions that, when made, least interfere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
conserved and not significantly changed by such substitutions. The table below shows amino acids 
which may be substituted for an original amino acid in a protein and which are regarded as conservative 
30 substitutions. 

Original Residue Conservative Substitution 

Ala Gly, Ser 

Arg His, Lys 

35 Asn Asp, Gin, His 
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A<;n Oln 

iUllj will 






Ala Qpr 

./Aid., OC1 




VJ1I1 


A^ti frlii Hie 






Acn fll r\ Hfic 
iVOj^J, VJlll, ilia 




oiy 


Ala 




rxio 


Acn Arc* din frlii 




lie 


T pii Val 

JLrCU , V ol 






Tip Val 

IXC, V dl 




Lys 


Arg, Gin, Glu 


10 


Met 


Leu, lie 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 


15 


Tyr 


His, Phe, Trp 




Val 


lie, Leu, Thr 



Conservative substitutions generally maintain (a) the structure of the polypeptide backbone in 
20 the area of the substitution, for example, as a beta sheet or alpha helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 

"Deletion" refers to a change in either a nucleic or amino acid sequence in which at least one 
nucleotide or amino acid residue, respectively, is absent. 

"Derivative" refers to the chemical modification of a nucleic acid sequence, such as by 
25 replacement of hydrogen by an alkyl, acyl, amino, hydroxyl, or other group. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

"E- value" refers to the statistical probability that a match between two sequences occurred by 

chance. 

3 0 A "fragment" is a unique portion of dithp or DITHP which is identical in sequence to but 

shorter in length than the parent sequence. A fragment may comprise up to the entire length of the 
defined sequence, minus one nucleotide/amino acid residue. For example, a fragment may comprise 
from 10 to 1000 contiguous amino acid residues or nucleotides. A fragment used as a probe, primer, 
antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 

35 60, 75, 100, 150, 250 or at least 500 contiguous amino acid residues or nucleotides in length. 
Fragments may be preferentially selected from certain regions of a molecule. For example, a 
polypeptide fragment may comprise a certain length of contiguous amino acids selected from the first 
250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain defined sequence. 
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Clearly these lengths are exemplary, and any length that is supported by the specification, including the 
Sequence Listing and the figures, may be encompassed by the present embodiments. 

A fragment of dithp comprises a region of unique polynucleotide sequence that specifically 
identifies dithp, for example, as distinct from any other sequence in the same genome. A fragment of 
5 dithp is useful, for example, in hybridization and amplification technologies and in analogous methods 
that distinguish dithp from related polynucleotide sequences. The precise length of a fragment of dithp 
and the region of dithp to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

A fragment of DITHP is encoded by a fragment of dithp. A fragment of DITHP comprises a 
10 region of unique amino acid sequence that specifically identifies DITHP. For example, a fragment of 
DITHP is useful as an immunogenic peptide for the development of antibodies that specifically 
recognize DITHP. The precise length of a fragment of DITHP and the region of DITHP to which the 
fragment corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

15 A "full length" nucleotide sequence is one containing at least a start site for translation to a 

protein sequence, followed by an open reading frame and a stop site, and encoding a "full length" 
polypeptide. 

"Hit" refers to a sequence whose annotation will be used to describe a given template. Criteria 
for selecting the top hit are as follows: if the template has one or more exact nucleic acid matches, the 

2 o top hit is the exact match with highest percent identity. If the template has no exact matches but has 

significant protein hits, the top hit is the protein hit with the lowest E-value. If the template has no 
significant protein hits, but does have significant non-exact nucleotide hits, the top hit is the nucleotide 
hit with the lowest E-value. 

"Homology" refers to sequence similarity either between a reference nucleic acid sequence and 
25 at least a fragment of a dithp or between a reference amino acid sequence and a fragment of a DITHP. 

"Hybridization" refers to the process by which a strand of nucleotides anneals with a 
complementary strand through base pairing. Specific hybridization is an indication that two nucleic 
acid sequences share a high degree of identity. Specific hybridization complexes form under defined 
annealing conditions, and remain hybridized after the "washing" step. The defined hybridization 

3 o conditions include the annealing conditions and the washing step(s), the latter of which is particularly 

important in determining the stringency of the hybridization process, with more stringent conditions 
allowing less non-specific binding, i.e., binding between pairs of nucleic acid probes that are not 
perfectly matched. Permissive conditions for annealing of nucleic acid sequences are routinely 
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determinable and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency. 

Generally, stringency of hybridization is expressed with reference to the temperature under 
which the wash step is carried out Generally, such wash temperatures are selected to be about 5°C to 
5 20°C lower than the thermal melting point (T^ for the specific sequence at a defined ionic strength and 
pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions for 
nucleic acid hybridization is well known and can be found in Sambrook et al., 1989, Molecular 
Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 

10 see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68 °C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, or 55°C may be used SSC concentration may be 
varied from about 0.2 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking reagents 

15 are used to block non-specific hybridization. Such blocking reagents include, for instance, denatured 
salmon sperm DNA at about 100-200 |ig/ml. Useful variations on these conditions will be readily 
apparent to those skilled in the art. Hybridization, particularly under high stringency conditions, may 
be suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 
of a similar role for the nucleotides and their resultant proteins. 

20 Other parameters, such as temperature, salt concentration, and detergent concentration may be 

varied to achieve the desired stringency. Denaturants, such as formamide at a concentration of about 
35-50% v/v, may also be used under particular circumstances, such as RNA:DNA hybridizations. 
Appropriate hybridization conditions are routinely determinable by one of ordinary skill in the art. 

"Immunogenic" describes the potential for a natural, recombinant, or synthetic peptide, epitope, 

25 polypeptide, or protein to induce antibody production in appropriate animals, cells, or cell lines. 

"Insertion" or "addition" refers to a change in either a nucleic or amino acid sequence in which 
at least one nucleotide or residue, respectively, is added to the sequence. 

"Labeling" refers to the covalent or noncovalent joining of a polynucleotide, polypeptide, or 
antibody with a reporter molecule capable of producing a detectable or measurable signal. 

30 "Microarray" is any arrangement of nucleic acids, amino acids, antibodies, etc., on a substrate. 

The substrate may be a solid support such as beads, glass, paper, nitrocellulose, nylon, or an 
appropriate membrane. 

linkers" are short stretches of nucleotide sequence which may be added to a vector or a dithp 
to create restriction endonuclease sites to facilitate cloning. "Polylinkers" are engineered to incorporate 
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multiple restriction enzyme sites and to provide for the use of enzymes which leave 5' or 3' overhangs 
(e.g., BamHI, EcoRI, and HindUl) and those which provide blunt ends (e.g., EcoRV, SnaBI, and StuI). 

"Naturally occurring" refers to an endogenous polynucleotide or polypeptide that may be 
isolated from viruses or prokaryotic or eukaryotic cells. 
5 "Nucleic acid sequence" refers to the specific order of nucleotides joined by phosphodiester 

bonds in a linear, polymeric arrangement. Depending on the number of nucleotides, the nucleic acid 
sequence can be considered an oligomer, oligonucleotide, or polynucleotide. The nucleic acid can be 
DNA, RNA, or any nucleic acid analog, such as PNA, may be of genomic or synthetic origin, may be 
either double-stranded or single-stranded, and can represent either the sense or antisense 

10 (complementary) strand. 

"Oligomer" refers to a nucleic acid sequence of at least about 6 nucleotides and as many as 
about 60 nucleotides, preferably about 15 to 40 nucleotides, and most preferably between about 20 and 
30 nucleotides, that may be used in hybridization or amplification technologies. Oligomers may be used 
as, e.g., primers for PCR, and are usually chemically synthesized. 

1 5 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

2 o "Peptide nucleic acid" (PNA) refers to a DNA mimic in which nucleotide bases are attached to 

a pseudopeptide backbone to increase stability. PNAs, also designated antigene agents, can prevent 
gene expression by targeting complementary messenger RNA. 

The phrases "percent identity" and "% identity", as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
25 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence 

3 o alignment program. This program is part of the L ASERGENE software package, a suite of molecular 

biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and Sharp, P.M. (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
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sheeted as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
5 Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to determine alignment between a known polynucleotide 
sequence and other sequences on a variety of databases. Also available is a tool called "BLAST 2 
10 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at ht^)^/www.ncbi.nlm.nih.gov/gQrf/bl2/. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
is 2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
2 0 Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 

2 5 defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

the length of a fragment taken from a largo:, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in figures or Sequence Listings, may be used to describe a length over which percentage 

3 o identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 
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The phrases "percent identity" and "% identity", as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
5 explained in more detail above, generally preserve the hydrophobicity and acidity of the substituted 
residue, thus preserving the structure (and therefore function) of the folded polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
10 CLUSTAL V, the default parameters are set as follows: Ktuple^l, gap penalty=3, window=5, and 
"diagonals saved' =5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity* 
between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
15 comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
(May-07-1999) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalty 
Gap x drop-off: SO 
20 Expect: 10 

Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 

2 5 example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 

a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in figures or Sequence Listings, may be used to describe a 
length over which percentage identity may be measured. 

3 o "Post-translational modification" of a DITHP may involve lipidation, glycosylation, 

phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu and the DITHP. 
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"Probe" refers to dithp or fragments thereof, which are used to detect identical, allelic or related 
nucleic acid sequences. Probes are isolated oligonucleotides or polynucleotides attached to a detectable 
label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent 
agents, and enzymes. "Primers" are short nucleic acids, usually DNA oligonucleotides, which may be 
5 annealed to a target polynucleotide by complementary base-pairing. The primer may then be extended 
along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification 
(and identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
10 be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 90, 100, or 
at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the figures and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
15 example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual , 2 nd ed, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et al.,1987, Current Protocols in Molecular Biology . 
Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols, A Guide 
to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be derived from 
a known sequence, for example, by using computer programs intended for that purpose such as Primer 
2 o (Version 0.5, 1 991 , Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar prima- selection 

2 5 programs have incorporated additional features for expanded capabilities. For example* the PrimOU 

primer selection program (available to the public from the Genome Cento at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 

3 o Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 

binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
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UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 

hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 

Hence, this program is useful for identification of both unique and conserved oligonucleotides and 

polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
5 above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 

primers, microarray elements, or specific probes to identify fully or partially complementary 

polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 

those described above. 

"Purified" refers to molecules, either polynucleotides or polypeptides that are isolated or 
10 separated from their natural environment and are at least 60% free, preferably at least 75% free, and 

most preferably at least 90% free from other compounds with which they are naturally associated. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 

This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
15 artificial manipulation of isolated segments of nucleic acids, e.g„ by genetic engineering techniques 

such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 

been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 

recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 

Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 
20 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

expressed, inducing a protective immunological response in the mammal. 

"Regulatory element" refers to a nucleic acid sequence from nontranslated regions of a gene, 

and includes enhancers, promoters, introns, and 3* untranslated regions, which interact with host 

2 5 proteins to carry out or regulate transcription or translation. 

"Reporter" molecules are chemical or biochemical moieties used for labeling a nucleic acid, an 
amino acid, or an antibody. They include radionuclides; enzymes; fluorescent, chemiluminescent, or 
chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and other moieties known in 
the art. 

3 o An "RNA equivalent," in reference to a DN A sequence, is composed of the same linear 

sequence of nucleotides as the reference DN A sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 



123 



WO 01/21836 



PCT/US00/25643 



"Sample" is used in its broadest sense. Samples may contain nucleic or amino acids, 
antibodies, or other materials, and may be derived from any source (e.g., bodily fluids including, but not 
limited to, saliva, blood, and urine; chromosome(s), organelles, or membranes isolated from a cell; 
genomic DNA, RNA, or cDNA in solution or bound to a substrate; and cleared cells or tissues or blots 
5 or imprints from such cells or tissues). 

"Specific binding" or "specifically binding" refers to the interaction between a protein or 
peptide and its agonist, antibody, antagonist, or other binding partner. The interaction is dependent 
upon the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
10 presence of a polypeptide containing epitope A, or the presence of free unlabeled A, in a reaction 
containing free labeled A and the antibody will reduce the amount of labeled A that binds to the 
antibody. 

"Substitution" refers to the replacement of at least one nucleotide or amino acid by a different 
nucleotide or amino acid. 

15 "Substrate" refers to any suitable rigid or semi-rigid support including, e.g., membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles or capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective patten of gene expression by a particular tissue or 

2 o cell type under given conditions at a given time. 

"Transformation" refers to a process by which exogenous DNA enters a recipient cell. 
Transformation may occur under natural or artificial conditions using various methods well known in 
the art. Transformation may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being 
25 transformed. 

"Transformants" include stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well as 
cells which transiently express inserted DNA or RNA. 

A "transgenic organism," as used herein* is any organism, including but not limited to animals 

3 o and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 

introduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant 
virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, 
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but rather is directed to the introduction of a recombinant DN A molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, and plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugatioiL Techniques 
5 for transferring the DNA of the present invention into such organisms are widely known and provided in 
references such as Sambrook et aL (1989), supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 25% sequence identity to the particular nucleic acid sequence ova* a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

10 set at default parameters. Such a pair of nucleic acids may show, for example, at least 30%, at least 
50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or even at least 98% or 
greater sequence identity over a certain defined length. The variant may result in "conservative" amino 
acid changes which do not affect structural and/or chemical properties. A variant may be described as, 
for example, an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice 

1 5 variant may have significant identity to a reference molecule, but will generally have a greater or lesser 
number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or lack domains that are present 
in the reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to each 

2 o other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 

individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence of 
SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for a 
disease state. 

25 In an alternati ve, variants of the polynucleotides of the present invention may be generated 

through recombinant methods. One possible method is a DNA shuffling technique such as 
MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A- et al. (1996) Nat Biotechnol. 14:315-319) to alter or improve 

3 o the biological properties of DITHP, such as its biological or enzymatic activity or its ability to bind to 

other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
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selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
5 family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
10 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

THE INVENTION 

is In a particular embodiment, cDNA sequences derived from human tissues and cell lines were 

aligned based on nucleotide sequence identity and assembled into "consensus" or template" sequences 
which are designated by the template identification numbers (template IDs) in column 2 of Table 1 . 
The sequence identification numbers (SEQ ID NO:s) corresponding to the template IDs are shown in 
column 1 . The template sequences have similarity to GenBank sequences, or "hits," as designated by 

20 the GI Numbers in column 3. The statistical probability of each GenBank hit is indicated by a 

probability score in column 4, and the functional annotation corresponding to each GenBank hit is listed 
in column 5. 

The invention incorporates the nucleic acid sequences of these templates as disclosed in the 
Sequence Listing and the use of these sequences in the diagnosis and treatment of disease states 
2 5 characterized by defects in human molecules. The invention further utilizes these sequences in 
hybridization and amplification technologies, and in particular, in technologies which assess gene 
expression patterns correlated with specific cells or tissues and their responses in vivo or in vitro to 
pharmaceutical agents, toxins, and other treatments. In this manner, the sequences of the present 
invention are used to develop a transcript image for a particular cell or tissue. 



30 



Derivation of Nucleic Acid Sequences 

cDN A was isolated from libraries constructed using RNA derived from normal and diseased 
human tissues and cell lines. The human tissues and cell lines used for cDNA library construction were 
selected from a broad range of sources to provide a diverse population of cDNAs representative of gene 
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transcription throughout the human body. Descriptions of the human tissues and cell lines used for 
cDNA library construction are provided in the LEFESEQ database (Incyte Genomics, Inc. (Incyte), Palo 
Alto C A). Human tissues were broadly selected from, for example, cardiovascular, dermatologic, 
endocrine, gastrointestinal, hematopoietic/immune system, musculoskeletal, neural, reproductive, and 
5 urologic sources. 

Cell lines used for cDNA library construction were derived from, for example, leukemic cells, 
teratocarcinomas, neuroepitheliomas, cervical carcinoma, lung fibroblasts, and endothelial cells. Such 
cell lines include, for example, THP-1, Jurkat, HUVEC, hNT2, WI38, HeLa, and other cell lines 
commonly used and available from public depositories (American Type Culture Collection, Manassas 
10 VA). Prior to mRNA isolation, cell lines were untreated, treated with a pharmaceutical agent such as 
5 -aza-2 -deoxycytidine, treated with an activating agent such as lipopolysaccharide in the case of 
leukocytic cell lines, or, in the case of endothelial cell lines, subjected to shear stress. 

Sequencing of the cDNAs 

1 5 Methods for DN A sequencing are well known in the art. Conventional enzymatic methods 

employ the Klenow fragment of DNA polymerase I, SEQUENASE DNA polymerase (U.S. 
Biochemical Corporation, Cleveland OH), Taq polymerase (PE Biosystems, Foster City CA), 
thermostable T7 polymerase (Amersham Pharmacia Biotech, Inc. (Amersham Pharmacia Biotech), 
Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found in 

20 the ELONGASE amplification system (Life Technologies Inc. (Life Technologies), Gaithersburg MD), 
to extend the nucleic acid sequence from an oligonucleotide primer annealed to the DNA template of 
interest. Methods have been developed for the use of both single-stranded and double-stranded 
templates. Chain termination reaction products may be electrophoresed on urea-polyacrylamide gels 
and detected either by autoradiography (for radioisotope-labeled nucleotides) or by fluorescence (for 

25 fluorophore-labeled nucleotides). Automated methods for mechanized reaction preparation, sequencing, 
and analysis using fluorescence detection methods have been developed. Machines used to prepare 
cDNAs for sequencing can include the MICROLAB 2200 liquid transfer system (Hamilton Company 
(Hamilton), Reno NV), Peltier thermal cycler (PTC200; MJ Research, Inc. (MJ Research), Watertown 
MA), and ABI CATALYST 800 thermal cycler (PE Biosystems). Sequencing can be carried out using, 

3 0 for example, the ABI 373 or 377 (PE Biosystems) or MEGAB ACE 1000 (Molecular Dynamics, Inc. 
(Molecular Dynamics), Sunnyvale CA) DNA sequencing systems, or other automated and manual 
sequencing systems well known in the art. 

The nucleotide sequences of the Sequence Listing have been prepared by current, state-of-the- 
art, automated methods and, as such, may contain occasional sequencing errors or unidentified 
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nucleotides. Such unidentified nucleotides are designated by an N. These infrequent unidentified bases 
do not represent a hindrance to practicing the invention for those skilled in the art. Several methods 
employing standard recombinant techniques may be used to correct errors and complete the missing 
sequence information. (See, e.g., those described in Ausubel, F.M. et aL (1997) Short Protocols in 
5 Molecular Biology . John Wiley & Sons, New York NY; and Sambrook, J. et al. (1989) Molecular 
Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY.) 

Assembly of cDNA Sequences 

Human polynucleotide sequences may be assembled using programs or algorithms well known 

10 in the art. Sequences to be assembled are related, wholly or in part, and may be derived from a single 
or many different transcripts. Assembly of the sequences can be performed using such programs as 
PHRAP (Phils Revised Assembly Program) and the GEL VIEW fragment assembly system (GCG), or 
other methods known in the art. 

Alternatively, cDNA sequences are used as "component" sequences that are assembled into 

15 "template" or "consensus" sequences as follows. Sequence chromatograms are processed, verified, and 
quality scores are obtained using PHRED. Raw sequences are edited using an editing pathway known 
as Block 1 (See, e.g., the LIFESEQ Assembled User Guide, Incyte Genomics, Palo Alto, CA). A series 
of BLAST comparisons is performed and low-information segments and repetitive elements (e.g., 
dinucleotide repeats, Alu repeats, etc.) are replaced by "n*s", or masked, to prevent spurious matches. 

2 o Mitochondrial and ribosomal RN A sequences are also removed. The processed sequences are then 
loaded into a relational database management system (RDMS) which assigns edited sequences to 
existing templates, if available. When additional sequences are added into the RDMS, a process is 
initiated which modifies existing templates or creates new templates from works in progress (i.e., 
nonfinal assembled sequences) containing queued sequences or the sequences themselves. After the new 

2 5 sequences have been assigned to templates, the templates can be merged into bins. If multiple templates 

exist in one bin, the bin can be split and the templates reannotated. 

Once gene bins have been generated based upon sequence alignments, bins are "clone joined" 
based upon clone information. Clone joining occurs when the 5' sequence of one clone is present in one 
bin and the 3' sequence from the same clone is present in a different bin, indicating that the two bins 

3 o should be merged into a single bin. Only bins which share at least two different clones are merged. 

A resultant template sequence may contain either a partial or a full length open reading frame, 
or all or part of a genetic regulatory element. This variation is due in part to the fact that the full length 
cDNAs of many genes are several hundred, and sometimes several thousand, bases in length. With 
current technology, cDNAs comprising the coding regions of large genes cannot be cloned because of 
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vector limitations, incomplete reverse transcription of the mRNA, or incomplete "second strand" 
synthesis. Template sequences may be extended to include additional contiguous sequences derived 
from the parent RN A transcript using a variety of methods known to those of skill in the art Extension 
may thus be used to achieve the full length coding sequence of a gene. 

5 

Analysis of the cDNA Sequences 

The cDNA sequences are analyzed using a variety of programs and algorithms which are well 

known in the art. (See, e.g., Ausubel, 1997, supra . Chapter 7.7; Meyers, R. A. (Ed.) (1995) Molecular 

Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853; and Table 6.) These analyses 
10 comprise both reading frame determinations, e.g., based on triplet codon periodicity for particular 

organisms (Fickett, J.W. (1982) Nucleic Acids Res. 10:5303-5318); analyses of potential start and stop 

codons; and homology searches. 

Computer programs known to those of skill in the art for performing computer-assisted 

searches for amino acid and nucleic acid sequence similarity, include, for example, Basic Local 
15 Alignment Search Tool (BLAST; Altschul, S.F. (1993) J. Mol. Evol. 36:290-300; Altschul, S.F. et al. 

(1990) J. Mol. Biol. 215:403-410). BLAST is especially useful in determining exact matches and 

comparing two sequence fragments of arbitrary but equal lengths, whose alignment is locally maximal 

and for which the alignment score meets or exceeds a threshold or cutoff score set by the user (Karlin, 

S. et al. (1988) Proc. Natl. Acad, Sci. USA 85:841-845). Using an appropriate search tool (e.g., 
20 BLAST or HMM), GenBank, SwissProt, BLOCKS, PFAM and other databases may be searched for 

sequences containing regions of homology to a query dithp or DITHP of the present invention. 

Other approaches to the identification, assembly, storage, and display of nucleotide and 

polypeptide sequences are provided in "Relational Database for Storing Biomolecule Information," 

U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
2 5 Database," U.S.S.N. 08/81 1 ,758, filed March 6, 1997; and "Relational Database and System for 

Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807, filed March 4, 1998, 

all of which are incorporated by reference herein in their entirety. 

Protein hierarchies can be assigned to the putative encoded polypeptide based on, e.g., motif, 

BLAST, or biological analysis. Methods for assigning these hierarchies are described, for example, in 
30 "Database System Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," 

U.S.S.N. 08/812,290, filed March 6, 1997, incorporated herein by reference. 

Identification of Human Diagnostic and Therapeutic Molecules Encoded by dithp 
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The identities of the DITHP encoded by the dithp of the present invention were obtained by 
analysis of the assembled cDNA sequences. SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and SEQ ID NO:8 encode, for example, human 
enzyme molecules. SEQ ID NO:9 encodes, for example, an extracellular information transmission 
,» 5 molecule. SEQ ID NO: 10 and SEQ ID NO:l 1 encode, for example, receptor molecules. SEQ ID 

NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, and SEQ 
ID NO: 18 encode, for example, intracellular signaling molecules. SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, 
SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, 

10 and SEQ ID NO:33 encode, for example, transcription factor molecules. SEQ ID NO:34 encodes, for 
example, a protein modification and maintenance molecule. SEQ ID NO:35 and SEQ ID NO:36 
encode, for example, nucleic acid synthesis and modification molecules. SEQ ID NO:37 encodes, for 
example, an antigen recognition molecule. SEQ ID NO:38 and SEQ ID NO:39 encode, for example, 
secreted/extracellular matrix molecules. SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 

15 NO:43, SEQ ID NO:44, and SEQ ID NO:45 encode, for example, cytoskeletal molecules. SEQ ID 
NO:46, SEQ ID NO:47, and SEQ ID NO:48 encode, for example, cell membrane molecules. SEQ ID 
NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, and SEQ ID NO:53 encode, for example, 
ribosomal molecules. SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID 
NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, and SEQ ID NO:63 

20 encode, for example, organelle associated molecules. SEQ ID NO:64, SEQ ID NO:65, SEQ ID 

NO:66, SEQ ID NO:67, and SEQ ID NO:68 encode, for example, biochemical pathway molecules. 
SEQ ID NO:69, SEQ ID NO:70, and SEQ ID NO:71 encode, for example, molecules associated with 
growth and development. 



25 Sequences of Human Diagnostic and Therapeutic Molecules 

The dithp of the present invention may be used for a variety of diagnostic and therapeutic 
purposes. For example, a dithp may be used to diagnose a particular condition, disease, or disorder 
associated with human molecules. Such conditions, diseases, and disorders include, but are not limited 
to, a cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 

30 cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall . 
bladder, ganglia, gastrointestinal tract, heart, kidney, fiver, lung, muscle, ovary, pancreas, parathyroid, 
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penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 
autoimmune/inflammatory disorder, such as inflammation, actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
5 autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout, Graves* disease, Hashimoto's thyroiditis, paroxysmal nocturnal 
hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with 

10 lymphocytotoxins, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 
polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 

15 complications of cancer, hemodialysis, and extracorporeal circulation, trauma, and hematopoietic 
cancer including lymphoma, leukemia, and myeloma; an infection caused by a viral agent classified 
as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, filovirus, hepadnavirus, herpesvirus, 
flavivirus, orthomyxovirus, parvovirus, papovavirus, paramyxovirus, picornavirus, poxvirus, reovirus, 
retrovirus, rhabdovirus, or togavirus; an infection caused by a bacterial agent classified as 

20 pneumococcus, staphylococcus, streptococcus, bacillus, corynebacterium, Clostridium, 

meningococcus, gonococcus, listeria, moraxella, kingella, haemophilus, legionella, bordetella, gram- 
negative enterobacterium including shigella, salmonella, or Campylobacter, pseudomonas, vibrio, 
brucella, francisella, yersinia, bartonella, norcardium, actinomyces, mycobacterium, spirochaetale, 
rickettsia, chlamydia, or mycoplasma; an infection caused by a fungal agent classified as aspergillus, 

25 blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, histoplasma, or other mycosis- 
causing fungal agent; and an infection caused by a parasite classified as Plasmodium or malaria- 
causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, Pneumocystis carinii, intestinal 
protozoa such as giardia, trichomonas, tissue nematode such as trichinella, intestinal nematode such 
as ascaris, lymphatic filarial nematode, trematode such as schistosoma, and cestrode such as 

3 o tapeworm; a developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, 
achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, 
WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), 
Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary 
keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 

3 5 hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, 
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spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing 
loss; an endocrine disorder such as a disorder of the hypothalamus and/or pituitary resulting from 
lesions such as a primary brain tumor, adenoma, infarction associated with pregnancy, 
hypophysectomy, aneurysm, vascular malformation, thrombosis, infection, immunological disorder, and 
t \ 5 complication due to head trauma; a disorder associated with hypopituitarism including hypogonadism, 

Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian disease, Letterer- 
Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism; a disorder associated with 
hypopituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma; a disorder associated with hypothyroidism 

10 including goiter, myxedema, acute thyroiditis associated with bacterial infection, subacute thyroiditis 
associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and cretinism; a disorder 
associated with hypothyroidism including thyrotoxicosis and its various forms, Grave's disease, 
pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's disease; a disorder 
associated with hyperparathyroidism including Conn disease (chronic hypercalemia); a pancreatic 

1 5 disorder such as Type I or Type II diabetes mellitus and associated complications; a disorder associated 
with the adrenals such as hypoplasia, carcinoma, or adenoma of the adrenal cortex, hypotension 
associated with alkalosis, amyloidosis, hypokalemia, Cushing's disease, Lidcfle's syndrome, and 
Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and Addison's disease; a disordo 
associated with gonadal steroid hormones such as: in women, abnormal prolactin production, 

2 o infertility, endometriosis, poturbation of the menstrual cycle, polycystic ovarian disease, 

hypoprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, hermaphroditism, 
hirsutism and virilization, breast cancer, and, in post-menopausal women, osteoporosis; and, in men, 
Leydig cell deficiency, male climacteric phase, and gominal cell aplasia, a hypergonadal disorder 
associated with Leydig cell tumors, androgen resistance associated with absence of androgen receptors, 

25 syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder such as Addison's disease, 

cerebrotendinous xanthomatosis, congenital adrenal hyperplasia, coumarin resistance, cystic fibrosis, 
diabetes, fatty hepatocirrhosis, fructose- 1,6-diphosphatase deficiency, galactosemia, goiter, 
glucagonoma, glycogen storage diseases, hereditary fructose intolerance, hyperadrenalism, 
hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, hyperthyroidism, 

30 hypoglycemia, hypothyroidism, hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, 
lysosomal storage diseases, mannosidosis, neuraminidase deficiency, obesity, pentosuria 
phenylketonuria, pseudovitamin D-deficiency rickets; disorders of carbohydrate metabolism such as 
congenital type II dyserythropoietic anemia, diabetes, insulin-dependent diabetes mellitus, 
non-insulhvdependent diabetes mellitus, fructose- 1,6-diphosphatase deficiency, galactosemia, 
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glucagonoma, hereditary fructose intolerance, hypoglycemia, mannosidosis, neuraminidase 
deficiency, obesity, galactose epimerase deficiency, glycogen storage diseases, lysosomal storage 
diseases, fructosuria, pentosuria, and inherited abnormalities of pyruvate metabolism; disorders of 
lipid metabolism such as fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, 
k) 5 carnitine palmitoyltransferase deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, 

lipid storage disorders such Fabry's disease, Gaucher's disease, Niemann-Pick's disease, 
metachromatic leukodystrophy, adrenoleukodystrophy, GMj gangliosidosis, and ceroid 
lipofuscinosis, abetalipoproteinemia, Tangier disease, hyperlipoproteinemia, diabetes mellitus, 
lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, adiposis dolorosa, lipoid 

10 adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, hypercholesterolemia, 

hypercholesterolemia with hypertriglyceridemia, primary hypoalphalipoproteinemia, hypothyroidism, 
renal disease, fiver disease, lecithinrcholesterol acyltransferase deficiency, cerebrotendinous 
xanthomatosis, sitosterolemia, hypercholesterolemia, Tay-Sachs disease, SandhofFs disease, 
hyperlipidemia, hyperlipemia, lipid myopathies, and obesity; and disorders of copper metabolism 

is such as Menke's disease, Wilson's disease, and Ehlers-Danlos syndrome type IX; a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, 

20 bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative 
intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion 
diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, 
fatal familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental 

2 5 retardation and other developmental disorder of the central nervous system, cerebral palsy, a 

neuroskeletal disorder, an autonomic nervous system disorder, a cranial nerve disorder, a spinal cord 
disease, muscular dystrophy and other neuromuscular disorder, a peripheral nervous system disorder, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathy, myasthenia 
gravis, periodic paralysis, a mental disorder including mood, anxiety, and schizophrenic disorders, 

30 seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
v dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 

gastrointestinal disorder including ulcerative colitis, gastric and duodenal ulcers, cystinuria, 
dibasicaminoaciduria, hypercystinuria, lysinuria, hartnup disease, tryptophan malabsorption, 
methionine malabsorption, histidinuiia, iminoglycinuria, dicarboxyUcaminoaciduria, cystinosis, renal 
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glycosuria, hypouricemia, familial hypophophatemic rickets, congenital chloridorrhea, distal renal 
tubular acidosis, Menkes' disease, Wilson's disease, lethal diarrhea, juvenile pernicious anemia, 
folate malabsorption, adrenoleukodystrophy, hereditary myoglobinuria, and Zellweger syndrome; a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 
normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 
cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, and polymyositis, neurological disorders associated with transport, e.g., 
Alzheimer's disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, 
paranoid psychoses, and schizophrenia, and other disorders associated with transport, e.g., 
neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, Cushing's disease, and Addison's disease; and a connective tissue disorder 
such as osteogenesis imperfecta, Ehlers-Danlos syndrome, chondrodysplasias, Marfan syndrome, 
Alport syndrome, familial aortic aneurysm, achondroplasia, mucopolysaccharidoses, osteoporosis, 
osteopetrosis, Paget's disease, rickets, osteomalacia, hyperparathyroidism, renal osteodystrophy, 
osteonecrosis, osteomyelitis, osteoma, osteoid osteoma, osteoblastoma, osteosarcoma, 
osteochondroma, chondroma, chondroblastoma, chondromyxoid fibroma, chondrosarcoma, fibrous 
cortical defect, nonossifying fibroma, fibrous dysplasia, fibrosarcoma, malignant fibrous 
histiocytoma, Ewing's sarcoma, primitive neuroectodermal tumor, giant cell tumor, osteoarthritis, 
rheumatoid arthritis, ankylosing spondylarthritis, Reiter's syndrome, psoriatic arthritis, enteropathic 
arthritis, infectious arthritis, gout, gouty arthritis, calcium pyrophosphate crystal deposition disease, 
ganglion, synovial cyst, villonodular synovitis, systemic sclerosis, Dupuytren's contracture, hepatic 
fibrosis, lupus erythematosus, mixed connective tissue disease, epidermolysis bullosa simplex, bullous 
congenital ichthyosiform erythroderma (epidermolytic hyperkeratosis), non-epidermolytic and 
epidermolytic palmoplantar keratoderma, ichthyosis bullosa of Siemens, pachyonychia congenita, and 
white sponge nevus. The dithp can be used to detect the presence of, or to quantify the amount of, a 
dithp-related polynucleotide in a sample. This information is then compared to information obtained 
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from appropriate reference samples, and a diagnosis is established. Alternatively, a polynucleotide 
complementary to a given dithp can inhibit or inactivate a therapeutically relevant gene related to the 
dithp. 

-« 

} 5 Analysis of dithp Expression Patterns 

The expression of dithp may be routinely assessed by hybridization-based methods to 
determine, for example, the tissue-specificity, disease-specificity, or developmental stage-specificity of 
dithp expression- For example, the level of expression of dithp may be compared among different cell 
types or tissues, among diseased and normal cell types or tissues, among cell types or tissues at 

10 different developmental stages, or among cell types or tissues undergoing various treatments. This type 
of analysis is useful, for example, to assess the relative levels of dithp expression in hilly or partially 
differentiated cells or tissues, to determine if changes in dithp expression levels are correlated with the 
development or progression of specific disease states, and to assess the response of a cell or tissue to a 
specific therapy, for example, in pharmacological or toxicological studies. Methods for the analysis of 

15 dithp expression are based on hybridization and amplification technologies and include membrane- 
based procedures such as northern blot analysis, high-throughput procedures that utilize, for example, 
microarrays, and PCR-based procedures. 

Hybridization and Genetic Analysis 

20 The dithp, their fragments, or complementary sequences, may be used to identify the presence 

of and/or to determine the degree of similarity between two (or more) nucleic acid sequences. The dithp 
may be hybridized to naturally occurring or recombinant nucleic acid sequences under appropriately 
selected temperatures and salt concentrations. Hybridization with a probe based on the nucleic acid 
sequence of at least one of the dithp allows for the detection of nucleic acid sequences, including 

2 5 genomic sequences, which are identical or related to the dithp of the Sequence Listing. Probes may be 
selected from non-conserved or unique regions of at least one of the polynucleotides of SEQ ID NO:l- 
71 and tested for their ability to identify or amplify the target nucleic acid sequence using standard 
protocols. 

Polynucleotide sequences that are capable of hybridizing, in particular, to those shown in SEQ 
30 ID NO: 1-71 and fragments thereof, can be identified using various conditions of stringency. (See, e.g., 
Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods 
Enzymol. 152:507-51 1.) Hybridization conditions are discussed in "Definitions." 

A probe for use in Southern or northern hybridization may be derived from a fragment of a 
dithp sequence, or its complement, that is up to several hundred nucleotides in length and is either 
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single-stranded or double-stranded. Such probes may be hybridized in solution to biological materials 
such as plasmids, bacterial, yeast, or human artificial chromosomes, cleared or sectioned tissues, or to 
artificial substrates containing dithp. Microarrays are particularly suitable for identifying the presence 
of and detecting the level of expression for multiple genes of interest by examining gene expression 
correlated with, e.g., various stages of development, treatment with a drug or compound, or disease 
progression. An array analogous to a dot or slot blot may be used to arrange and link polynucleotides 
to the surface of a substrate using one or more of the following: mechanical (vacuum), chemical, 
thermal, or UV bonding procedures. Such an array may contain any number of dithp and may be 
produced by hand or by using available devices, materials, and machines. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

Probes may be labeled by either PCR or enzymatic techniques using a variety of commercially 
available reporter molecules. For example, commercial kits are available for radioactive and 
chemiluminescent labeling (Amersham Pharmacia Biotech) and for alkaline phosphatase labeling (Life 
Technologies). Alternatively, dithp may be cloned into commercially available vectors for the 
production of RNA probes. Such probes may be transcribed in the presence of at least one labeled 
nucleotide (e.g., 32 P-ATP, Amersham Pharmacia Biotech). 

Additionally the polynucleotides of SEQ ID NO: 1-71 or suitable fragments thereof can be used 
to isolate full length cDNA sequences utilizing hybridization and/or amplification procedures well 
known in the art, e.g., cDNA library screening, PCR amplification, etc. The molecular cloning of such 
full length cDNA sequences may employ the method of cDNA library screening with probes using the 
hybridization, stringency, washing, and probing strategies described above and in Ausubel, supra . 
Chapters 3, 5, and 6. These procedures may also be employed with genomic libraries to isolate 
genomic sequences of dithp in order to analyze, e.g„ regulatory elements. 

Genetic Mapping 

Gene identification and mapping are important in the investigation and treatment of almost all 
conditions, diseases, and disorders. Cancer, cardiovascular disease, Alzheimer's disease, arthritis, 
diabetes, and mental illnesses are of particular interest. Each of these conditions is more complex than 
the single gene defects of sickle cell anemia or cystic fibrosis, with select groups of genes being 
predictive of predisposition for a particular condition, disease, or disorder. For example, 
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cardiovascular disease may result from malfunctioning receptor molecules that fail to clear cholesterol 
from the bloodstream, and diabetes may result when a particular individual's immune system is 
activated by an infection and attacks the insulin-producing cells of the pancreas. In some studies, 
Alzheimer's disease has been linked to a gene on chromosome 21 ; other studies predict a different gene 
5 and location. Mapping of disease genes is a complex and reiterative process and generally proceeds 
from genetic linkage analysis to physical mapping. 

As a condition is noted among members of a family, a genetic linkage map traces parts of 
chromosomes that are inherited in the same pattern as the condition. Statistics link the inheritance of 
particular conditions to particular regions of chromosomes, as defined by RFLP or other markers. 

10 (See, for example, Lander, E. S. and Botstein, D. (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 
Occasionally, genetic markers and their locations are known from previous studies. More often, 
however, the markers are simply stretches of DNA that differ among individuals. Examples of genetic 
linkage maps can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. 

is In another embodiment of the invention, dithp sequences may be used to generate hybridization 

probes useful in chromosomal mapping of naturally occurring genomic sequences. Either coding or 
noncoding sequences of dithp may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a dithp coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 

2 o chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific region 

of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
25 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
mapping techniques and genetic map data. (See, e.g., Meyers, supra , pp. 965-968.) Correlation 
between the location of dithp on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that disorder. 

3 o The dithp sequences may also be used to detect polymorphisms that are genetically linked to the 

inheritance of a particular condition, disease, or disorder. 

In situ hybridization of chromosomal preparations and genetic mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending existing genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
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mouse, may reveal associated markers even if the number or arm of the corresponding human 
chromosome is not known. These new marker sequences can be mapped to human chromosomes and 
may provide valuable information to investigators searching for disease genes using positional cloning 
or other gene discovery techniques. Once a disease or syndrome has been crudely correlated by genetic 
5 linkage with a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 

mapping to that area may represent associated or regulatory genes for further investigatioa (See, e.g., 
Gatti, R. A. et al. (1988) Nature 336:577-580.) The nucleotide sequences of the subject invention may 
also be used to detect differences in chromosomal architecture due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

10 Once a disease-associated gene is mapped to a chromosomal region, the gene must be cloned in 

order to identify mutations or other alterations (e.g., translocations or inversions) that may be correlated 
with disease. This process requires a physical map of the chromosomal region containing the disease- 
gene of interest along with associated markers. A physical map is necessary for determining the 
nucleotide sequence of and order of marker genes on a particular chromosomal region. Physical 

15 mapping techniques are well known in the art and require the generation of overlapping sets of cloned 
DNA fragments from a particular organelle, chromosome, or genome. These clones are analyzed to 
reconstruct and catalog their order. Once the position of a marker is determined, the DNA from that 
region is obtained by consulting the catalog and selecting clones from that region. The gene of interest 
is located through positional cloning techniques using hybridization or similar methods. 

20 

Diagnostic Uses 

The dithp of the present invention may be used to design probes useful in diagnostic assays. 
Such assays, well known to those skilled in the art, may be used to detect or confirm conditions, 
disorders, or diseases associated with abnormal levels of dithp expression. Labeled probes developed 

2 5 from dithp sequences are added to a sample under hybridizing conditions of desired stringency. In some 
instances, dithp, or fragments or oligonucleotides derived from dithp, may be used as primers in 
amplification steps prior to hybridization. The amount of hybridization complex formed is quantified 
and compared with standards for that cell or tissue. If dithp expression varies significantly from the 
standard, the assay indicates the presence of the condition, disorder, or disease. Qualitative or 

30 quantitative diagnostic methods may include northern, dot blot, or other membrane or dip-stick based 
technologies or multiple-sample format technologies such as PCR, enzyme-linked immunosorbent assay 
(ELISA)-like, pin, or chip-based assays. 

The probes described above may also be used to monitor the progress of conditions, disorders, 
or diseases associated with abnormal levels of dithp expression, or to evaluate the efficacy of a 
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particular therapeutic treatment. The candidate probe may be identified from the dithp that are specific 
to a given human tissue and have not been observed in GenBank or other genome databases. Such a 
probe may be used in animal studies, preclinical tests, clinical trials, or in monitoring the treatment of 
an individual patient. In a typical process, standard expression is established by methods well known in 
5 the art for use as a basis of comparison, samples from patients affected by the disorder or disease are 
combined with the probe to evaluate any deviation from the standard profile, and a therapeutic agent is 
administered and effects are monitored to generate a treatment profile. Efficacy is evaluated by 
determining whether the expression progresses toward or returns to the standard normal pattern. 
Treatment profiles may be generated over a period of several days or several months. Statistical 

1 o methods well known to those skilled in the art may be use to determine the significance of such 

therapeutic agents. 

The polynucleotides are also useful for identifying individuals from minute biological samples, 
for example, by matching the RFLP patten of a sample's DNA to that of an individual's DNA. The 
polynucleotides of the present invention can also be used to determine the actual base-by-base DNA 

15 sequence of selected portions of an individual's genome. These sequences can be used to prepare PCR 
primers for amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, an individual can be identified through a unique set of DNA sequences. Once a unique ID 
database is established for an individual, positive identification of that individual can be made from 
extremely small tissue samples. 

20 In a particular aspect, oligonucleotide primers derived from the dithp of the invention may be 

used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of SNP 
detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from dithp are used to 

2 5 amplify DNA using the polymerase chain reaction (PCR). The DNA may be derived, for example, 

from diseased or normal tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause 
diff erences in the secondary and tertiary structures of PCR products in single-stranded form, and these 
differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the 
oligonucleotide primers are fluorescently labeled, which allows detection of the amplimers in high- 
30 throughput equipment such as DNA sequencing machines. Additionally, sequence database analysis 
methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by comparing the 
sequences of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
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chromatograms. In the alternative, SNPs may be detected and characterized by mass spectrometry 
using, for example, the high throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

DNA-based identification techniques are critical in forensic technology. DNA sequences taken 
from very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, 
5 semen, etc., can be amplified using, e.g., PCR, to identify individuals. (See, e.g., Erlich, H. (1992) 
PCR Technology , Freeman and Co., New York, NY). Similarly, polynucleotides of the present 
invention can be used as polymorphic markers. 

There is also a need for reagents capable of identifying the source of a particular tissue. 
Appropriate reagents can comprise, for example, DNA probes or primers prepared from the sequences 
10 of the present invention that are specific for particular tissues. Panels of such reagents can identify 

tissue by species and/or by organ type. In a similar fashion, these reagents can be used to screen tissue 
cultures for contamination. 

The polynucleotides of the present invention can also be used as molecular weight markers on 
nucleic acid gels or Southern blots, as diagnostic probes for the presence of a specific mRNA in a 
1 5 particular cell type, in the creation of subtracted cDNA libraries which aid in the discovery of novel 

polynucleotides, in selection and synthesis of oligomers for attachment to an array or other support, and 
as an antigen to elicit an immune response. 

Disease Model Systems Using dithp 

2 o The dithp of the invention or their mammalian homologs may be "knocked out" in an animal 

model system using homologous recombination in embryonic stem (ES) cells. Such techniques are well 
known in the art and are useful for the generation of animal models of human disease. (See, e.g., U.S. 
Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For example, mouse ES cells, such as 
the mouse 129/SvJ cell line, are derived from the early mouse embryo and grown in culture. The ES 

2 5 cells are transformed with a vector containing the gene of interest disrupted by a marker gene, e.g., the 
neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 244:1288-1292). The vector 
integrates into the corresponding region of the host genome by homologous recombination. 
Alternatively, homologous recombination takes place using the Cre-loxP system to knockout a gene of 
interest in a tissue- or developmental stage-specific manner (Marth, J.D. (1996) Clin. Invest. 97:1999- 

30 2002; Wagner, K.U. et aL (1997) Nucleic Acids Res. 25:4323-4330). Transformed ES cells are 

identified and microinjected into mouse cell blastocysts such as those from the C57BL/6 mouse strain. 
The blastocysts are surgically transferred to pseudopregnant dams, and the resulting chimeric progeny 
are genotyped and bred to produce heterozygous or homozygous strains. Transgenic animals thus 
generated may be tested with potential therapeutic or toxic agents. 
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The dithp of the invention may also be manipulated in vitro in ES cells derived from human 
blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell lineages 
including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate into, for 
example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et al. (1998) Science 
5 282:1145-1147). 

The dithp of the invention can also be used to create "knockin" humanized animals (pigs) or 
transgenic animals (mice or rats) to model human disease. With knockin technology, a region of dithp 
is injected into animal ES cells, and the injected sequence integrates into the animal cell genome. 
Transformed cells are injected into blastulae, and the blastulae are implanted as described above. 
10 Transgenic progeny or inbred lines are studied and treated with potential pharmaceutical agents to 
obtain information on treatment of a human disease. Alternatively, a mammal inbred to overexpress 
dithp, resulting, e.g., in the secretion of DITHP in its milk, may also serve as a convenient source of 
that protein (Jamie, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 

is Screening Assays 

DITHP encoded by polynucleotides of the present invention may be used to screen for 
molecules that bind to or are bound by the encoded polypeptides. The binding of the polypeptide and 
the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the 
polypeptide or the bound molecule. Examples of such molecules include antibodies, oligonucleotides, 

20 proteins (e.g., receptors), or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g., a ligand 
or fragment thereof, a natural substrate, or a structural or functional mimetic. (See, Coligan et al., 
(1991) Current Protocols in Immunology 1(2): Chapter 5.) Similarly, the molecule can be closely 
related to the natural receptor to which the polypeptide binds, or to at least a fragment of the receptor, 

2 5 e.g., the active site. In either case, the molecule can be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells which express the 
polypeptide, either as a secreted protein or on the cell membrane. Preferred cells include cells from 
mammals, yeast, Drosophila , or E. coli . Cells expressing the polypeptide or cell membrane fractions 
which contain the expressed polypeptide are then contacted with a test compound and binding, 

3 o stimulation, or inhibition of activity of either the polypeptide or the molecule is analyzed. 

An assay may simply test binding of a candidate compound to the polypeptide, wherein binding 
is detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. Alternatively, 
the assay may assess binding in the presence of a labeled competitor. 
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Additionally, the assay can be carried out using cell-free preparations, polypeptide/molecule 
affixed to a solid support, chemical libraries, or natural product mixtures. The assay may also simply 
comprise the steps of mixing a candidate compound with a solution containing a polypeptide, measuring 
polypeptide/molecule activity or binding, and comparing the polypeptide/molecule activity or binding to 
5 a standard. 

Preferably, an ELISA assay using, e.g., a monoclonal or polyclonal antibody, can measure 
polypeptide level in a sample. The antibody can measure polypeptide level by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of the above assays can be used in a diagnostic or prognostic context. The molecules 
10 discovered using these assays can be used to treat disease or to bring about a particular result in a 

patient (e.g., blood vessel growth) by activating or inhibiting the rxriypeptide/molecule. Moreover, the 
assays can discover agents which may inhibit or enhance the production of the polypeptide from 
suitably manipulated cells or tissues. 



15 Transcript Imaging and Toxicological Testing 

Another embodiment relates to the use of dithp to develop a transcript image of a tissue or cell 
type. A transcript image represents the global pattern of gene expression by a particular tissue or cell 
type. Global gene expression patterns are analyzed by quantifying the number of expressed genes and 
their relative abundance under given conditions and at a given time. (See Seilhamer et al., 

2 o "Comparative Gene Transcript Analysis," U.S. Patent Number 5,840,484, expressly incorporated by 
reference herein.) Thus a transcript image may be generated by hybridizing the polynucleotides of the 
present invention or their complements to the totality of transcripts or reverse transcripts of a particular 
tissue or cell type. In one embodiment, the hybridization takes place in high-throughput format, 
wherein the polynucleotides of the present invention or their complements comprise a subset of a 

2 5 plurality of elements on a microarray. The resultant transcript image would provide a profile of gene 
activity pertaining to human molecules for diagnostics and therapeutics. 

Transcript images which profile dithp expression may be generated using transcripts isolated 
from tissues, cell lines, biopsies, or other biological samples. The transcript image may thus reflect 
dithp expression in vivo , as in the case of a tissue or biopsy sample, or in vitro , as in the case of a cell 

30 line. 

Transcript images which profile dithp expression may also be used in conjunction with in vitro 
model systems and preclinical evaluation of pharmaceuticals, as well as toxicological testing of 
industrial and naturally-occurring environmental compounds. All compounds induce characteristic 
gene expression patterns, frequently tamed molecular fingerprints or toxicant signatures, which are 
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indicative of mechanisms of action and toxicity (Nuwaysir, E. F. et al. (1999) Mol. Carcinog. 24:153- 
159; Steiner, S. and Anderson, N. L. (2000) Toxicol. Lett. 112-113:467-71, expressly incorporated by 
reference herein). If a test compound has a signature similar to that of a compound with known 
toxicity, it is likely to share those toxic properties. These fingerprints or signatures are most useful and 
5 refined when they contain expression information from a large number of genes and gene families. 
Ideally, a genome- wide measurement of expression provides the highest quality signature. Even genes 
whose expression is not altered by any tested compounds are important as well, as the levels of 
expression of these genes are used to normalize the rest of the expression data. The normalization 
procedure is useful for comparison of expression data alto treatment with different compounds. While 

10 the assignment of gene function to elements of a toxicant signature aids in interpretation of toxicity 
mechanisms, knowledge of gene function is not necessary for the statistical matching of signatures 
which leads to prediction of toxicity. (See, for example, Press Release 00-02 from the Nations 
Institute of Environmental Health Sciences, released February 29, 2000, available at 
http^/www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is important and desirable in 

15 toxicological screening using toxicant signatures to include all expressed gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 

2 o invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of DITHP encoded by polynucleotides of the 
present invention to analyze the proteome of a tissue or cell type. The term proteome refers to the 

2 5 global pattern of protein expression in a particular tissue or cell type. Each protein component of a 

proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles, 
are analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 

3 o achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 

isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
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generally proportional to the level of the protein in the sample. The optical densities of equivalently 
positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment. The proteins in the spots are partially sequenced using, for 
5 example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for DITHP to quantify the 

10 levels of DITHP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array element (Lueking, A. et al. ( 1 999) Anal. Biocbem. 270: 1 03- 1 1 ; 
Mendoze, L. G. et al. (1999) Biotechniques 27:778-88). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 

1 5 reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 
be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N. L. and 
Seilhamer, J. (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 

20 analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 

2 5 sample are separated so that the amount of each protein can be quantified. The amount of each protein 

is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the DITHP encoded by polynucleotides of 

3 0 the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the DITHP encoded by polynucleotides of the present invention. The amount 
of protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
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sample is compared with the amount in an untreated biological sample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 

Transcript images may be used to profile dithp expression in distinct tissue types. This process 
5 can be used to determine human molecule activity in a particular tissue type relative to this activity in a 
different tissue type. Transcript images may be used to generate a profile of dithp expression 
characteristic of diseased tissue. Transcript images of tissues before and after treatment may be used 
for diagnostic purposes, to monitor the progression of disease, and to monitor the efficacy of drug 
treatments for diseases which affect the activity of human molecules. 

10 Transcript images of cell lines can be used to assess human molecule activity and/or to identify 

cell lines that lack or misregulate this activity. Such cell lines may then be treated with pharmaceutical 
agents, and a transcript image following treatment may indicate the efficacy of these agents in restoring 
desired levels of this activity. A similar approach may be used to assess the toxicity of pharmaceutical 
agents as reflected by undesirable changes in human molecule activity. Candidate pharmaceutical 

15 agents may be evaluated by comparing their associated transcript images with those of pharmaceutical 
agents of known effectiveness. 

Antisense Molecules 

The polynucleotides of the present invention are useful in antisense technology. Antisense 
2 o technology or therapy relies on the modulation of expression of a target protein through the specific 
binding of an antisense sequence to a target sequence encoding the target protein or directing its 
expressioa (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics, Humana Press Inc., Totawa 
NJ; Alama, A et al. (1997) Pharmacol. Res. 36(3):171-178; Crooke, S.T. (1997) Adv. Pharmacol. 
40:1-49; Sharma, H.W. and R. Narayanan (1995) Bioessays 17(12):1055-1063; andLavrosky, Y. et 
25 al. (1997) Biochem. Mol. Med. 62(1):1 1-22.) An antisense sequence is a polynucleotide sequence 
capable of specifically hybridizing to at least a portion of the target sequence. Antisense sequences 
bind to cellular mRNA and/or genomic DNA, affecting translation and/or transcription. Antisense 
sequences can be DNA, RNA, or nucleic acid mimics and analogs. (See, e.g., Rossi, J.J. et al. (1991) 
Antisense Res. Dev. l(3):285-288; Lee, R. et al. (1998) Biochemistry 37(3):900-1010; Pardridge, 
30 W.M. et al. (1995) Proc. Natl. Acad. Sci. USA 92(12):5592-5596; and Nielsen, P. E. and Haaima, G. 
(1997) Chem. Soc. Rev. 96:73-78.) Typically, the binding which results in modulation of expression 
occurs through hybridization or binding of complementary base pairs. Antisense sequences can also 
bind to DNA duplexes through specific interactions in the major groove of the double helix. 
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The polynucleotides of the present invention and fragments thereof can be used as antisense 
sequences to modify the expression of the polypeptide encoded by dithp. The antisense sequences can 
be produced ex vivo , such as by using any of the ABI nucleic acid synthesizer series (PE Biosystems) 
or other automated systems known in the art. Antisense sequences can also be produced biologically, 
such as by transforming an appropriate host cell with an expression vector containing the sequence of 
interest. (See, e.g., Agrawal, supra .) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense sequences 
into appropriate target cells can be used. Antisense sequences can be delivered intracellular^ in the 
form of an expression plasmid which, upon transcription, produces a sequence complementary to at 
least a portion of the cellular sequence encoding the target protein. (See, e.g., Slater, J.E., et al. (1998) 
J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J., et al. (1995) 9(13).1288-1296.) 
Antisense sequences can also be introduced intracellularly through the use of viral vectors, such as 
retrovirus and adencnassociated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 76:271; Ausubel, 
F.M. et al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY; Uckert, 
W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art. (See, e.g., 
Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et al. (1998) J. Pharm. SdL 87(11):1308- 
1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 25(14):2730-2736.) 

Expression 

In order to express a biologically active DITHP, the nucleotide sequences encoding DITHP or 
fragments thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding DITHP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DN A techniques, synthetic 
techniques, and in vivo genetic recombination. (See, e.g., Sambrook, supra , Chapters 4, 8, 16, and 17; 
and Ausubel, supra , Chapters 9, 10, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding DITHP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 



146 



WO 01/21836 PCT/US00/25643 



animal (mammalian) cell systems. (See, e.g., Sambrook, supra : Ausubel, 1995, supra . Van Heeke, G. 

and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 

153:516-544; Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) 

Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 7:1937-1945; 
5 Takamatsu, N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, 

R. et al. (1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; 

The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, 

J.J. et al. (1997) Nat Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, 
10 or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 

sequences to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) 

Cancer Gen. Thar. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad Sci. USA 90(13):6340-6344; 

Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 

31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not 
1 5 limited by the host cell employed. 

For long term production of recombinant proteins in mammalian systems, stable expression of 

DITHP in cell lines is preferred. For example, sequences encoding DITHP can be transformed into cell 

lines using expression vectors which may contain viral origins of replication and/or endogenous 

expression elements and a selectable marker gene on the same or on a separate vector. Any number of 
20 selection systems may be used to recover transformed cell lines. (See, e.g., Wigler, M. et al. (1977) 

Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.; Wigler, M. et al. (1980) Proc. Natl. Acad. 

Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14; Hartman, S.C. and 

R.C.Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:8047-8051; Rhodes, C.A. (1995) Methods Mol. 

Biol. 55:121-131.) 

25 

Therapeutic Uses of dithp 

The dithp of the invention may be used for somatic or germline gene therapy. Gene therapy 
may be performed to (i) correct a genetic deficiency (e.g., in the cases of severe combined 
immunodeficiency (SCID)-X1 disease characterized by X-linked inheritance (Cavazzana-Calvo, M. et 
30 al. (2000) Science 288:669-672), severe combined immunodeficiency syndrome associated with an 
inherited adenosine deaminase (ADA) deficiency (Blaese, R.M. et al. (1995) Science 270:475-480; 
Bordignon, C. et al. (1995) Science 270:470-475), cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207- 
216; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:667-703), thalassemias, familial hypercholesterolemia, and hemophilia resulting from Factor 
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VIII or Factor IX deficiencies (Crystal, R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. 
(1997) Nature 389:239-242)), (ii) express a conditionally lethal gene product (e.g., in the case of 
cancers which result from unregulated cell proliferation), or (iii) express a protein which affords 
protection against intracellular parasites (e.g., against human retroviruses, such as human 
5 immunodeficiency virus (HIV) (Baltimore, D. (1988) Nature 335:395-396; Poeschla, E. et al. (1996) 
Proc. Natl. Acad. Sci. USA. 93:11395-11399), hepatitis B or C virus (HBV, HCV); fungal parasites, 
such as Candida albicans and Paracoccidioides brasiliensis ; and protozoan parasites such as 
Plasmodium falciparum and Trypanosoma cruzi) . In the case where a genetic deficiency in dithp 
expression or regulation causes disease, the expression of dithp from an appropriate population of 

10 transduced cells may alleviate the clinical manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in dithp 
are treated by constructing mammalian expression vectors comprising dithp and introducing these 
vectors by mechanical means into dithp-deficient cells. Mechanical transfer technologies for use with 
cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 

1 5 particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNAtransposons (Morgan, R.A. and Anderson, W.F. (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and R6cipon, H. (1998) Curr. Opin. Biotechnol. 9:445- 
450). 

Expression vectors that may be effective for the expression of dithp include, but are not limited 
20 to, the PCDNA 3.1 , EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). The dithp of the invention 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), S V40 virus, thymidine kinase (TK), or 0-actin genes), (ii) an inducible promoter 
25 (e.g., the tetracycline-regulated promoter (Gossen, M. and Bujard, H. (1992) Proc. Natl. Acad. Sci. 
U.S.A. 89:5547-5551; Gossen, M. et al., (1995) Science 268:1766-1769; Rossi, F.M.V. and Blau, 
H.M. (1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid 
(Invitrogen); the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 
30 (Rossi, F.M.V. and Blau, H.M. supra) , or (iii) a tissue-specific promoter or the native promoter of the 
endogenous gene encoding DITHP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
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parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. andEb, AJ. (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to dithp expression are treated by constructing a retrovirus vector consisting of (i) dithp under 
the control of an independent promoter or the retrovirus long terminal repeat (LTR) promoter, (ii) 
appropriate RNA packaging signals, and (iii) a Rev-responsive element (RRE) along with additional 
retrovirus cis-acting RNA sequences and coding sequences required for efficient vector propagation. 
Retrovirus vectors (e.g., PFB and PFBNEO) are commercially available (Stratagene) and are based on 
published data (Riviere, I. et al. (1995) Proc. Natl. Acad. Sci. U.S.A. 92:6733-6737), incorporated by 
reference herein. The vector is propagated in an appropriate vector producing cell line (VPCL) that 
expresses an envelope gene with a tropism for receptors on the target cells or a promiscuous envelope 
protein such as VSVg (Armentano, D. et al. (1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) 
J. Virol. 61:1639-1646; Adam, M.A. and Miller, A.D. (1988) J. Virol. 62:3802-3806; Dull, T. et al. 
(1998) J. Virol. 72:8463-8471; Zufferey, R. et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 
5,910,434 to Rigg ("Method for obtaining retrovirus packaging cell lines producing high transducing 
efficiency retroviral supernatant") discloses a method for obtaining retrovirus packaging cell lines and 
is hereby incorporated by reference. Propagation of retrovirus vectors, transduction of a population of 
cells (e.g., CD4 + T-cells), and the return of transduced cells to a patient are procedures well known to 
persons skilled in the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. 
Virol. 71:7020-7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 
71:4707-4716; Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 95:1201-1206; Su, L. (1997) 
Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver dithp to 
cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
construction and packaging of adenovirus-based vectors are well known to those with ordinary skill in 
the art Replication defective adenovirus vectors have proven to be versatile for importing genes 
encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. et al. (1995) 
Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. Patent 
Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated by 
reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. Nutr. 19:51 1-544 
and Verma, I.M. and Somia, N. (1997) Nature 18:389:239-242, both incorporated by reference herein. 
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In another alternative, a herpes-based, gene therapy delivery system is used to deliver dithp to 
target cells which have one or more genetic abnormalities with respect to the expression of dithp. The 
use of herpes simplex virus (HS V)-based vectors may be especially valuable for introducing dithp to 
cells of the central nervous system, for which HSV has a tropism. The construction and packaging of 
5 herpes-based vectors are well known to those with ordinary skill in the art. A replication-competent 
herpes simplex virus (HSV) type 1 -based vector has been used to deliver a reporter gene to the eyes of 
primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The construction of a HSV-1 virus vector 
has also been disclosed in detail in U.S. Patent Number 5,804,413 to DeLuca ("Herpes simplex virus 
strains for gene transfer"), which is hereby incorporated by reference. U.S. Patent Number 5,804,413 

10 teaches the use of recombinant HSV d92 which consists of a genome containing at least one exogenous 
gene to be transferred to a cell undo* the control of the appropriate promoter for purposes including 
human gene therapy. Also taught by this patent are the construction and use of recombinant HSV 
strains deleted for ICP4, ICP27 and ICP22. For HSV vectors, see also Goins, W. F. et al. 1999 J. 
Virol. 73:519-532 and Xu, H. et al., (1994) Dev. Biol. 163:152-161, hereby incorporated by reference. 

1 5 The manipulation of cloned herpesvirus sequences, the generation of recombinant virus following the 
transfection of multiple plasmids containing different segments of the large herpesvirus genomes, the 
growth and propagation of herpesvirus, and the infection of cells with herpesvirus are techniques well 
known to those of ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RN A virus) vector is used to 

2 o deliver dithp to target cells. The biology of the prototypic alphavirus, Semliki Forest Virus (SFV), has 
been studied extensively and gene transfer vectors have been based on the SFV genome (Garoff, H. and 
Li, K-J. (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus RN A replication, a subgenomic 
RNA is generated that normally encodes the viral capsid proteins. This subgenomic RNA replicates to 
higher levels than the full-length genomic RNA, resulting in the overproduction of capsid proteins 

2 5 relative to the viral proteins with enzymatic activity (e.g. , protease and polymerase). Similarly, 

inserting dithp into the alphavirus genome in place of the capsid-coding region results in the production 
of a large number of dithp RNAs and the synthesis of high levels of DITHP in vector transduced cells. 
While alphavirus infection is typically associated with cell lysis within a few days, the ability to 
establish a persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus 

3 o (SIN) indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene 

therapy application (Dryga, S. A. et al. (1997) Virology 228:74-83). The wide host range of 
alphaviruses will allow the introduction of dithp into a variety of cell types. The specific transduction 
of a subset of cells in a population may require the sorting of cells prior to transduction. The methods 
of manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
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transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Antibodies 

Anti-DITHP antibodies may be used to analyze protein expression levels. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, and Fab fragments. For 
descriptions of and protocols of antibody technologies, see, e.g., Pound J.D. (1998) Immunochemical 
Protocols , Humana Press, Totowa, NJ. 

The amino acid sequence encoded by the dithp of the Sequence Listing may be analyzed by 
appropriate software (e.g., LASERGENE NAVIGATOR software, DNASTAR) to determine regions 
of high immunogenicity . The optimal sequences for immunization are selected from the C -terminus, the 
N-terminus, and those intervening, hydrophilic regions of the polypeptide which are likely to be exposed 
to the external environment when the polypeptide is in its natural conformation. Analysis used to select 
appropriate epitopes is also described by Ausubel (1997, supra , Chapter 1 1 .7). Peptides used for 
antibody induction do not need to have biological activity; however, they must be antigenic. Peptides 
used to induce specific antibodies may have an amino acid sequence consisting of at five amino acids, 
preferably at least 10 amino acids, and most preferably 15 amino acids. A peptide which mimics an 
antigenic fragment of the natural polypeptide may be fused with another protein such as keyhole limpet 
cyanin (KLH; Sigma, St. Louis MO) for antibody production. A peptide encompassing an antigenic 
region may be expressed from a dithp, synthesized as described above, or purified from human cells. 

Procedures well known in the art may be used for the production of antibodies. Various hosts 
including mice, goats, and rabbits, may be immunized by injection with a peptide. Depending on the 
host species, various adjuvants may be used to increase immunological response. 

In one procedure, peptides about 15 residues in length may be synthesized using an ABI 431 A 
peptide synthesizer (PE Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction 
with M-maleimidobenzoyl-N-hydroxysuccinimide ester (Ausubel, 1995, supra) . Rabbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant. The resulting antisera are 
tested for antipeptide activity by binding the peptide to plastic, blocking with 1 % bovine serum albumin 
(BS A), reacting with rabbit antisera, washing, and reacting with radioiodinated goat anti-rabbit IgG. 
Antisera with antipeptide activity are tested for anti-DITHP activity using protocols well known in the 
art, including ELISA, radioimmunoassay (RIA), and immunoblotting. 

In another procedure, isolated and purified peptide may be used to immunize mice (about 100 
jig of peptide) or rabbits (about 1 mg of peptide). Subsequently, the peptide is radioiodinated and used 
to screen the immunized animals' B-lymphocytes for production of antipeptide antibodies. Positive 
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cells are then used to produce hybridomas using standard techniques. About 20 mg of peptide is 
sufficient for labeling and screening several thousand clones. Hybridomas of interest are detected by 
screening with radioiodinated peptide to identify those fusions producing peptide-specific monoclonal 
antibody. In a typical protocol, wells of a multi-well plate (FAST, Becton-Dickinson, Palo Alto, C A) 
5 are coated with affinity-purified, specific rabbit-anti-mouse (or suitable anti-species IgG) antibodies at 
10 mg/ml. The coated wells are blocked with 1 % BSA and washed and exposed to supernatants from 
hybridomas. After incubation, the wells are exposed to radiolabeled peptide at 1 mg/ml. 

Clones producing antibodies bind a quantity of labeled peptide that is detectable above 
background. Such clones are expanded and subjected to 2 cycles of cloning. Cloned hybridomas are 

10 injected into pristane-treated mice to produce ascites, and monoclonal antibody is purified from the 
ascitic fluid by affinity chromatography on protein A (Amersham Pharmacia Biotech). Several 
procedures for the production of monoclonal antibodies, including in vitro production, are described in 
Pound (supra) . Monoclonal antibodies with antipeptide activity are tested for anti-DITHP activity 
using protocols well known in the art, including ELISA, RI A, and immunoblotting. 

15 Antibody fragments containing specific binding sites for an epitope may also be generated. For 

example, such fragments include, but are not limited to, the F(ab')2 fragments produced by pepsin 
digestion of the antibody molecule, and the Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, construction of Fab expression libraries in filamentous 
bacteriophage allows rapid and easy identification of monoclonal fragments with desired specificity 

20 (Pound, supra . Chaps. 45-47). Antibodies generated against polypeptide encoded by dithp can be used 
to purify and characterize full-length DITHP protein and its activity, binding partners, etc. 

Assays Using Antibodies 

Anti-DITHP antibodies may be used in assays to quantify the amount of DITHP found in a 

2 5 particular human cell. Such assays include methods utilizing the antibody and a label to detect 

expression level under normal or disease conditions. The peptides and antibodies of the invention may 
be used with or without modification or labeled by joining them, either covalently or noncovalently, 
with a reporter molecule. 

Protocols for detecting and measuring protein expression using either polyclonal or monoclonal 

3 o antibodies are well known in the art. Examples include ELISA RIA, and fluorescent activated cell 

sorting (FACS). Such immunoassays typically involve the formation of complexes between the DITHP 
and its specific antibody and the measurement of such complexes. These and other assays are described 
in Pound (supra) . 
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Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of 
the disclosure in any way whatsoever. 
5 The disclosures of all patents, applications, and publications mentioned above and below, in 

particular U.S. Ser. No. 60/156,294, U.S. Ser. No. 60/155,760, U.S. Ser. No. 60/155,939, U.S. Ser. 
No. 60/156,565, U.S. Ser. No. 60/156,624, U.S. Ser. No. 60/156,625, U.S. Ser. No. 60/167,542, U.S. 
So-. No. 60/167,522, U.S. Ser. No. 60/167,453, U.S. Ser. No. 60/167,517, U.S. Ser. No. 60/167,943, 
U.S. Ser. No. 60/167,945, U.S. Ser. No. 60/167,520, U.S. Ser. No. 60/168,468, U.S. Ser. No. 
10 60/168,599, U.S. Ser. No. 60/167,410, U.S. Ser. No. 60/168,265, U.S. Ser. No. 60/168,429, U.S. Ser. 
No. 60/168,432, U.S. Ser. No. 60/167,521, U.S. Ser. No. 60/168,857, U.S. Ser. No. 60/168,197, U.S. 
Ser. No. 60/168,611, and U.S. Ser. No. 60/168,613 are hereby expressly incorporated by reference. 

EXAMPLES 

15 I. Construction of cDNA Libraries 

RNA was purchased from CLONTECH Laboratories, Inc. (Palo Alto CA) or isolated from 
various tissues. Some tissues were homogenized and lysed in guanidinium isothiocyanate, while others 
were homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates 

2 o were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In most cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega Corporation (Promega), Madison WI), 

2 5 OLIGOTEX latex particles (QIAGEN, Inc. (QIAGEN), Valencia CA), or an OLIGOTEX mRNA 
purification kit (QIAGEN). Alternatively, RNA was isolated directly from tissue lysates using other 
RNA isolation kits, e.g., the POLY(A)PURE mRNA purification kit (Ambion, Inc., Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 

30 vector system (Stratagene Cloning Systems, Inc. (Stratagene), La Jolla CA) or SUPERSCRIPT 

plasmid system (Life Technologies), using the recommended procedures or similar methods known in 
the art. (See, e.g., Ausubel, 1997, supra . Chapters 5.1 through 6.6.) Reverse transcription was 
initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were ligated to double 
stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme or enzymes. For 
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most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL S1000, SEPHAROSE 
CL2B, or SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or 
preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of 
the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl plasmid 
(Life Technologies), or pINC Y (Incyte). Recombinant plasmids were transformed into competent E. 
coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or 
ElectroMAX DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 
(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: the Magic or 
WIZARD Minipreps DNA purification system (Promega); the AGTC Miniprep purification kit (Edge 
BioSystems, Gaithersburg MD); and the QIAWELL 8, QIAWELL 8 Plus, and QIAWELL 8 Ultra 
plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit (QIAGEN). Following 
precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or without 
lyophilization, at4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format. (Rao, V.B. (1994) Anal. Biochem. 216:1-14.) Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Inc. (Molecular Probes), Eugene OR) and a FLUOROSKAN II 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 

TIL Sequencing and Analysis 

cDN A sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 thermal cycler (PE Biosystems) or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific 
Corp., Sunnyvale CA) or the MTCROLAB 2200 liquid transfer system (Hamilton). cDNA sequencing 
reactions were prepared using reagents provided by Amersham Pharmacia Biotech or supplied in ABI 
sequencing kits such as the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE 
Biosystems). Electrophoretic separation of cDNA sequencing reactions and detection of labeled 
polynucleotides were carried out using the MEGAB ACE 1000 DNA sequencing system (Molecular 
Dynamics); the ABI PRISM 373 or 377 sequencing system (PE Biosystems) in conjunction with 
standard ABI protocols and base calling software; or other sequence analysis systems known in the art. 
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Reading frames within the cDN A sequences were identified using standard methods (reviewed in 
Ausubel, 1997, supra . Chapter 7.7). Some of the cDNA sequences were selected for extension using 
the techniques disclosed in Example VIII. 

IV. Assembly and Analysis of Sequences 

Component sequences from chromatograms were subject to PHRED analysis and assigned a 
quality score. The sequences having at least a required quality score were subject to various pre- 
processing editing pathways to eliminate, e.g., low quality 3* ends, vector and linker sequences, polyA 
tails, Alu repeats, mitochondrial and ribosomal sequences, bacterial contamination sequences, and 
sequences smaller than 50 base pairs. In particular, low-information sequences and repetitive elements 
(e.g., dinucleotide repeats, Alu repeats, etc.) were replaced by "n's", or masked, to prevent spurious 
matches. 

Processed sequences were then subject to assembly procedures in which the sequences were 
assigned to gene bins (bins). Each sequence could only belong to one bin. Sequences in each gene bin 
were assembled to produce consensus sequences (templates). Subsequent new sequences were added to 
existing bins using BLASTn (v. 1.4 WashU) and CROSSMATCH. Candidate pairs ware identified as 
all BLAST hits having a quality score greater than or equal to 150. Alignments of at least 82% local 
identity were accepted into the bin. The component sequences from each bin were assembled using a 
version of PHRAP. Bins with several overlapping component sequences were assembled using DEEP 
PHRAP. The orientation (sense or antisense) of each assembled template was determined based on the 
number and orientation of its component sequences. Template sequences as disclosed in the sequence 
listing correspond to sense strand sequences (the "forward" reading frames), to the best determination. 
The complementary (antisense) strands are inherently disclosed herein- The component sequences 
which were used to assemble each template consensus sequence are listed in Table 4, along with their 
positions along the template nucleotide sequences. 

Bins were compared against each other and those having local similarity of at least 82% were 
combined and reassembled. Reassembled bins having templates of insufficient overlap (less than 95% 
local identity) were re-split. Assembled templates were also subject to analysis by STITCHER/EXON 
MAPPER algorithms which analyze the probabilities of the presence of splice variants, alternatively 
spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or 
disease states, etc. These resulting bins were subject to several rounds of the above assembly 
procedures. 

Once gene bins were generated based upon sequence alignments, bins were clone joined based 
upon clone information. If the 5* sequence of one clone was present in one bin and the 3' sequence from 
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the same clone was present in a different bin, it was likely that the two bins actually belonged together 
in a single bin. The resulting combined bins underwent assembly procedures to regenerate the 
consensus sequences. 

The final assembled templates were subsequently annotated using the following procedure. 
Template sequences were analyzed using BLASTn (v2.0, NCBI) versus gbpri (GenBank version 118). 
"Hits'* were defined as an exact match having from 95% local identity over 200 base pairs through 
100% local identity over 100 base pairs, or a homolog match having an E-value, i.e. a probability 
score, of £ 1 x 10 8 . The hits were subject to frameshift FASTx versus GENPEPT (GenBank version 
1 1 8). (See Table 6). In this analysis, a homolog match was defined as having an E-value of <, 1 x 10*. 
The assembly method used above was described in "System and Methods for Analyzing Biomolecular 
Sequences," U.S.S.N. 09/276,534, filed March 25, 1999, and the LIFESEQ Gold user manual (Incyte) 
both incorporated by reference herein. 

Following assembly, template sequences were subjected to motif, BLAST, and functional 
analyses, and categorized in protein hierarchies using methods described in, e.g., "Database System 
Employing Protein Function Hierarchies for Viewing Biomolecular Sequence Data," U.S.S.N. 
08/812,290, filed March 6, 1997; "Relational Database for Storing Biomolecule Information,'* 
U.S.S.N. 08/947,845, filed October 9, 1997; "Project-Based Full-Length Biomolecular Sequence 
Database," U.S.S.N. 08/81 1,758, filed March 6, 1997; and "Relational Database and System for 
Storing Information Relating to Biomolecular Sequences," U.S.S.N. 09/034,807, filed March 4, 1998, 
all of which are incorporated by reference herein. 

The template sequences were further analyzed by translating each template in all three forward 
reading frames and searching each translation against the Pf am database of hidden Markov model- 
based protein families and domains using the HMMER software package (available to the public from 
Washington University School of Medicine, St Louis MO). Regions of templates which, when 
translated, contain similarity to Pfam consensus sequences are reported in Table 2, along with 
descriptions of Pfam protein domains and families. Only those Pfam hits with an E-value of £ 1 x 10 3 
are reported. (See also World Wide Web site http://pfamwustl.edu/ for detailed descriptions of Pfam 
protein domains and families.) 

Additionally, the template sequences were translated in all three forward reading frames, and 
each translation was searched against hidden Markov models for signal peptide and transmembrane 
domains using the HMMER software package. Construction of hidden Markov models and their usage 
in sequence analysis has been described. (See, for example, Eddy, S.R. (1996) Curr. Opin. Str. Biol. 
6:361-365.) Regions of templates which, when translated, contain similarity to signal peptide or 
transmembrane domain consensus sequences are reported in Table 3. Only those signal peptide or 
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transmembrane hits with a cutoff score of 1 1 bits or greater are reported. A cutoff score of 1 1 bits or 
greater corresponds to at least about 91-94% true-positives in signal peptide prediction, and at least 
about 75% true-positives in transmembrane domain prediction. 

The results of HMMER analysis as reported in Tables 2 and 3 may support the results of 
5 BLAST analysis as reported in Table 1 or may suggest alternative or additional properties of template- 
encoded polypeptides not previously uncovered by BLAST or other analyses. 

Template sequences are further analyzed using the bioinformatics tools listed in Table 6, or 
using sequence analysis software known in the art such as MACDNASIS PRO software (Hitachi 
Software Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Template 
10 sequences may be further queried against public databases such as the GenBank rodent, mammalian, 
vertebrate, prokaryote, and eukaryote databases. 

V. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
1 5 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 
supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LDFESEQ (Incyte Genomics). This analysis is 

2 o much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 

search can be modified to determine whether any particular match is categorized as exact or similar. 

* 

The basis of the search is the product score, which is defined as: 

BLAST Score x Pa-cent Identity 

25 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
30 by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 

assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 



157 



WO 01/21836 



PCT/US00/25643 



example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

VI. Tissue Distribution Profiling 

A tissue distribution profile is determined for each template by compiling the cDNA library 
tissue classifications of its component cDNA sequences. Each component sequence, is derived from a 
cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following categories: cardiovascular system; connective tissue; digestive system; embryonic structures; 
endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; hemic and immune 
system; liver; musculoskeletal system; nervous system; pancreas; respiratory system; sense organs; 
skin; stomatognathic system; unclassified/mixed; or urinary tract. Template sequences, component 
sequences, and cDNA library/tissue information are found in the LIFESEQ GOLD database (Incyte 
Genomics, Palo Alto CA). 

Table 5 shows the tissue distribution profile for the templates of the invention. For each 
template, the three most frequently observed tissue categories are shown in column 3, along with the 
percentage of component sequences belonging to each category. Only tissue categories with 
percentage values of £ 10% are shown. A tissue distribution of "widely distributed" in column 3 
indicates percentage values of <10% in all tissue categories. 

VII. Transcript Image Analysis 

Transcript images are generated as described in Seilhamer et al., "Comparative Gene 
Transcript Analysis," U.S. Patent Number 5,840,484, incorporated herein by reference. 

VIII. Extension of Polynucleotide Sequences and Isolation of a Full-length cDNA 

Oligonucleotide primers designed using a dithp of the Sequence Listing are used to extend the 
nucleic acid sequence. One primer is synthesized to initiate 5* extension of the template, and the other 
prima:, to initiate 3' extension of the template. The initial primers may be designed using OLIGO 4.06 
software (National Biosciences, Inc. (National Biosciences), Plymouth MN), or another appropriate 
program, to be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to 
anneal to the target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides 
which would result in hairpin structures and primer-primer dimerizations are avoided. Selected human 
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cDNA libraries are used to extend the sequence. If more than one extension is necessary or desired, 
additional or nested sets of primers are designed. 

High fidelity amplification is obtained by PCR using methods well known in the art. PCR is 
performed in 96-well plates using the PTC-200 thermal cycler (MJ Research). The reaction mix 
contains DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , and B- 
mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme (Life 
Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for prima- pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ are as follows: Step 1 : 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well is determined by dispensing 100 \x\ PICOGREEN 
quantitation reagent (0.25% (v/v); Molecular Probes) dissolved in IX Tris-EDTA (TE) and 0.5 jil of 
undiluted PCR product into each well of an opaque fluorimeter plate (Corning Incorporated (Corning), 
Corning NY), allowing the DNA to bind to the reagent The plate is scanned in a FLUOROSKAN II 
(Labsystems Oy) to measure the fluorescence of the sample and to quantify the concentration of DNA. 
A 5 \i\ to 10 |il aliquot of the reaction mixture is analyzed by electrophoresis on a 1 % agarose mini-gel 
to determine which reactions are successful in extending the sequence. 

The extended nucleotides are desalted and concentrated, transferred to 384-well plates, 
digested with CylFI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments are excised, and agar digested with AGAR ACE (Promega). Extended clones are 
religated using T4 ligase (New England Biolabs, Inc., Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells are selected on antibiotic-containing 
media, individual colonies are picked and cultured overnight at 37 °C in 384-well plates in LB/2x 
carbenicillin liquid media. 

The cells are lysed, and DNA is amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA is quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries are reamplified 
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using the same conditions as described above. Samples are diluted with 20% dimcthysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (PE Biosystems). 

In like manner, the dithp is used to obtain regulatory sequences (promoters, introns, and 
enhancers) using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

IX. Labeling of Probes and Southern Hybridization Analyses 

Hybridization probes derived from the dithp of the Sequence Listing are employed for 
screening cDNAs, mRNAs, or genomic DNA. The labeling of probe nucleotides between 100 and 
1000 nucleotides in length is specifically described, but essentially the same procedure may be used 
with largo* cDNA fragments. Probe sequences are labeled at room temperature for 30 minutes using a 
T4 polynucleotide kinase, y^P-ATP, and 0.5X One-Phor-All Plus (Amersham Pharmacia Biotech) 
buffo* and purified using a ProbeQuant G-50 Microcolumn (Amersham Pharmacia Biotech). The 
probe mixture is diluted to 10 7 dpm/^g/ml hybridization buffer and used in a typical membrane-based 
hybridization analysis. 

The DNA is digested with a restriction endonuclease such as Eco R V and is electrophoresed 
through a 0.7% agarose gel. The DNA fragments are transferred from the agarose to nylon membrane 
(NYTRAN Plus, Schleicher & Schuell, Inc., Keene NH) using procedures specified by the 
manufacturer of the membrane. Prehybridization is carried out for three or more hours at 68 °C, and 
hybridization is carried out overnight at 68 °C. To remove non-specific signals, blots are sequentially 
washed at room temperature under increasingly stringent conditions, up to 0.1 x saline sodium citrate 
(SSC) and 0.5% sodium dodecyl sulfate. After the blots are placed in a PHOSPHORIMAGER cassette 
(Molecular Dynamics) or are exposed to autoradiography film, hybridization patterns of standard and 
experimental lanes are compared. Essentially the same procedure is employed when screening RNA. 

X. Chromosome Mapping of dithp 

The cDNA sequences which were used to assemble SEQ ID NO: 1-71 are compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that match SEQ 
ID NO: 1-71 are assembled into clusters of contiguous and ova-lapping sequences using assembly 
algorithms such as PHRAP (Table 6). Radiation hybrid and genetic mapping data available from 
public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for Genome 
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Research (WIGR), and G6n&hon are used to determine if any of the clustered sequences have been 
previously mapped. Inclusion of a mapped sequence in a cluster will result in the assignment of all 
sequences of that cluster, including its particular SEQ ID NO:, to that map location. The genetic map 
locations of SEQ ID NO: 1-71 are described as ranges, or intervals, of human chromosomes. The map 
5 position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
are based on genetic markers mapped by G6n&hon which provide boundaries for radiation hybrid 

1 o markers whose sequences were included in each of the clusters. 

XI. Microarray Analysis 

Probe Preparation from Tissue or Cell Samples 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

1 5 polyA* RNA is purified using the oligo (dT) cellulose method. Each polyA + RNA sample is reverse 
transcribed using MMLV reverse-transcriptase, 0.05 pg/^1 oligo-dT primer (21mer), IX first strand 
buffer, 0.03 units/^il RNase inhibitor, 500 \xM dATP, 500 \xM dGTP, 500 \xM dTTP, 40 |iM dCTP, 
40 |iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng polyA + RNA with GEMBRIGHT kits 

20 (Incyte). Specific control polyA + RNAs are synthesized by in vitro transcription from non-coding yeast 
genomic DNA (W. Lei, unpublished). As quantitative controls, the control mRNAs at 0.002 ng, 0.02 
ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction at ratios of 1 :100,000, 1 :10,000, 
1:1000, 1:100 (w/w) to sample mRNA respectively . The control mRNAs are diluted into reverse 
transcription reaction at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, 25:1 (w/w) to sample mRNA differential 

2 5 expression patterns. After incubation at 37° C for 2 hr, each reaction sample (one with Cy3 and another 

with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 20 minutes at 
85° C to the stop the reaction and degrade the RNA Probes are purified using two successive 
CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. (CLONTECH), Palo 
Alto CA) and after combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 
2 o mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The probe is then dried to completion 
using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 ^il 5X SSC/0.2% 
SDS. 

Microarray Preparation 
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Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDN A insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 Mg. 
5 Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR Scientific 
Products Corporation (VWR), West Chester, PA), washed extensively in distilled water, and coated 
10 with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C ovea 
Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 \xl of the array element DNA, at an average 
concentration of 100 ng/nl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 
1 5 Microarrays are UV-crosslinked using a STR ATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford, MA) for 30 minutes at 60° C followed by washes in 0.2% 
SDS and distilled water as before. 

20 

Hybridization 

Hybridization reactions contain 9 jil of probe mixture consisting of 0.2 \xg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The probe mixture 
is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with an 1 .8 
2 5 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly larger 
than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 *il of 
5x SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 
hours at 60°C. The arrays are washed for 10 min at 45°C in a first wash buffer (IX SSC, 0.1% SDS), 
three times for 10 minutes each at 45° C in a second wash buffer (0. IX SSC), and dried. 

30 

Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDN A control species added to the probe mix at a known concentration. A specific location on the 
array contains a complementary DN A sequence, allowing the intensity of the signal at that location to 
be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two probes from different 
sources (e.g., representing test and control cells), each labeled with a different fluorophore, are 
hybridized to a single array for the purpose of identifying genes that are differentially expressed, the 
calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and adding 
identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood, MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated to 
obtain a numerical value corresponding to the average intensity of the signal. The software used for 
signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XII. Complementary Nucleic Acids 

Sequences complementary to the dithp are used to detect, decrease, or inhibit expression of the 
naturally occurring nucleotide. The use of oligonucleotides comprising from about 15 to 30 base pairs 
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is typical in the art. However, smaller or larger sequence fragments can also be used. Appropriate 
oligonucleotides are designed from the dithp using OLIGO 4.06 software (National Biosciences) or 
other appropriate programs and are synthesized using methods standard in the art or ordered from a 
commercial supplier. To inhibit transcription, a complementary oligonucleotide is designed from the 
5 most unique 5 * sequence and used to prevent transcription factor binding to the promoter sequence. To 
inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal binding and 
processing of the transcript. 

XIII. Expression of DITHP 

10 Expression and purification of DITHP is accomplished using bacterial or virus-based 

expression systems. For expression of DITHP in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels of 
cDN A transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) 
hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator 

1 5 regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., 

BL21(DE3). Antibiotic resistant bacteria express DITHP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of DITHP in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 

20 replaced with cDNA encoding DITHP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See e.g., Engelhard, 

25 supra : and Sandig, supra .) 

In most expression systems, DITHP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicunu enables the purification of fusion proteins on immobilized 

3 o glutathione under conditions that maintain protein activity and antigenicity ( Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from DITHP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak 
Company, Rochester NY). 6-His, a stretch of six consecutive histidine residues, enables purification on 
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metal-chelate resins (QIAGEN). Methods for protein expression and purification are discussed in 
Ausubel (1995, supra . Chapters 10 and 16). Purified DITHP obtained by these methods can be used 
directly in the following activity assay. 

5 XIV. Demonstration of DITHP Activity 

DITHP activity is demonstrated through a variety of specific assays, some of which are 
outlined below. 

Oxidoreductase activity of DITHP is measured by the increase in extinction coefficient of 
NAD(P)H coenzyme at 340 nmfor the measurement of oxidation activity, or the decrease in extinction 

l o coefficient of NAD(P)H coenzyme at 340 nmfor the measurement of reduction activity (Dalziel, K. 

(1963) J. Biol. Chem. 238:2850-2858). One of three substrates may be used: Asn-PGal, biocytidine, or 
ubiquinone- 10. The respective subunits of the enzyme reaction, for example, cytochtome c t -b 
oxidoreductase and cytochrome c, are reconstituted. The reaction mixture contains a) 1-2 mg/ml 
DITHP; and b) 15 mM substrate, 2.4 mM NAD(P) + in 0.1 M phosphate buffer, pH 7.1 (oxidation 

15 reaction), or 2.0 mM NAD(P)H, in 0.1 M Na2HP0 4 buffer, pH 7.4 ( reduction reaction); in a total 

volume of 0. 1 ml. Changes in absorbance at 340 nm (A^q) are measured at 23.5 ° C using a recording 
spectrophotometer (Shimadzu Scientific Instruments, Inc., Pleasanton CA). The amount of NAD(P)H 
is stoichiometrically equivalent to the amount of substrate initially present, and the change in ^ 40 is a 
direct measure of the amount of NAD(P)H produced; AA^o = 6620[NADH]. Oxidoreductase activity 

20 of DITHP activity is proportional to the amount of NAD(P)H present in the assay. 

Transferase activity of DITHP is measured through assays such as a methyl transferase assay 
in which the transfer of radiolabeled methyl groups between a donor substrate and an acceptor 
substrate is measured (Bokar, J.A. et al. (1994) J. Biol. Chem. 269:17697-17704). Reaction mixtures 
(50 |Ld final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 3% 

25 polyvinylalcohol, 1.5 nCi [™?tfry/- 3 H]AdoMet (0.375 |iM AdoMet) (DuPont-NEN)> 0.6 jig DITHP, 
and acceptor substrate (0.4 ng [ 35 S]RNA or 6-mercaptopurine (6-MP) to 1 mM final concentration). 
Reaction mixtures are incubated at 30°C for 30 minutes, then 65 P C for 5 minutes. The products are 
separated by chromatography or electrophoresis and the level of methyl transferase activity is 
determined by quantification of methyl- 3 H recovery. 

30 DITHP hydrolase activity is measured by the hydrolysis of appropriate synthetic peptide 

substrates conjugated with various chromogenic molecules in which the degree of hydrolysis is 
quantified by spectrophotometric (or fluorometric) absorption of the released chromophore. (Beynon, 
R.J. and J.S. Bond (1994) Proteolytic Enzymes: A Practical Approach, Oxford University Press, New 
York NY, pp. 25-55) Peptide substrates are designed according to the category of protease activity as 
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endopeptidase (serine, cysteine, aspartic proteases), animopeptidase (leucine aminopeptidase), or 
carboxypeptidase (Carboxypeptidase A and B, procollagen C-proteinase). 

DITHP isomerase activity such as peptidyl prolyl cis/trans isomerase activity can be assayed 
by an enzyme assay described by Rahfeld, J.U., et al. (1994) (FEBS Lett 352:180-184). The assay 
5 is performed at 10°C in 35 mM HEPES buffer, pH 7.8, containing chymotrypsin (0.5 mg/ml) and 
DITHP at a variety of concentrations. Under these assay conditions, the substrate, Suc-Ala-Xaa-Pro- 
Phe-4-NA, is in equilibrium with respect to the prolyl bond, with 80-95% in trans and 5-20% in cis 
conformation. An aliquot (2 ul) of the substrate dissolved in dimethyl sulfoxide (10 mg/ml) is added 
to the reaction mixture described above. Only the cis isomer of the substrate is a substrate for 

10 cleavage by chymotrypsin. Thus, as the substrate is isomerized by DITHP, the product is cleaved by 
chymotrypsin to produce 4-nitroanilide, which is detected by it's absorbance at 390 nm. 4- 
Nitroanilide appears in a time-dependent and a DITHP concentration-dependent manner. 

An assay for DITHP activity associated with growth and development measures cell 
proliferation as the amount of newly initiated DNA synthesis in Swiss mouse 3T3 cells. A plasmid 

15 containing polynucleotides encoding DITHP is transfected into quiescent 3T3 cultured cells using 

methods well known in the art. The transiently transfected cells are then incubated in the presence of 
[ 3 H]thymidine, a radioactive DNA precursor. Where applicable, varying amounts of DITHP ligand are 
added to the transfected cells. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured 
over an appropriate time interval, and the amount incorporated is directly proportional to the amount of 

20 newly synthesized DNA 

Growth factor activity of DITHP is measured by the stimulation of DNA synthesis in Swiss 
mouse 3T3 cells (McKay, I. and I. Leigh, eds. (1993) Growth Factors: A Practical Approach , Oxford 
University Press, New York NY). Initiation of DNA synthesis indicates the cells' entry into the mitotic 
cycle and their commitment to undergo later division. 3T3 cells are competent to respond to most 

25 growth factors, not only those that are mitogenic, but also those that are involved in embryonic 

induction. This competence is possible because the in vivo specificity demonstrated by some growth 
factors is not necessarily inherent but is determined by the responding tissue. In this assay, varying 
amounts of DITHP are added to quiescent 3T3 cultured cells in the presence of [ 3 H]thymidine, a 
radioactive DNA precursor. DITHP for this assay can be obtained by recombinant means or from 

30 biochemical preparations. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured over 
an appropriate time interval, and the amount incorporated is directly proportional to the amount of 
newly synthesized DNA. A linear dose-response curve over at least a hundred-fold DITHP 
concentration range is indicative of growth factor activity. One unit of activity per milliliter is defined 
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as the concentration of DITHP producing a 50% response level, where 100% represents maximal 
incorporation of [ 3 H]thymidine into acid-precipitable DNA. 

Alternatively, an assay for cytokine activity of DITHP measures the proliferation of 
leukocytes. In this assay, the amount of tritiated thymidine incorporated into newly synthesized DNA 
5 is used to estimate proliferative activity. Varying amounts of DITHP are added to cultured 

leukocytes, such as granulocytes, monocytes, or lymphocytes, in the presence of [ 3 H]thymidine, a 
radioactive DNA precursor. DITHP for this assay can be obtained by recombinant means or from 
biochemical preparations. Incorporation of [ 3 H]thymidine into acid-precipitable DNA is measured 
over an appropriate time interval, and the amount incorporated is directly proportional to the amount 

10 of newly synthesized DNA. A linear dose-response curve over at least a hundred-fold DITHP 
concentration range is indicative of DITHP activity. One unit of activity per milliliter is 
conventionally defined as the concentration of DITHP producing a 50% response level, where 100% 
represents maximal incorporation of [ 3 H]thymidine into acid-precipitable DNA. 

An alternative assay for DITHP cytokine activity utilizes a Boyden micro chamber 

15 (Neuroprobe, Cabin John MD) to measure leukocyte chemotaxis (Vicari, supra) . In this assay, about 
10 5 migratory cells such as macrophages or monocytes are placed in cell culture media in the upper 
compartment of the chamber. Varying dilutions of DITHP are placed in the lower compartment The 
two compartments are separated by a 5 or 8 micron pore polycarbonate filter (Nucleopore, Pleasanton 
CA). After incubation at 37 °C for 80 to 120 minutes, the filters are fixed in methanol and stained 

2 o with appropriate labeling agents. Cells which migrate to the other side of the filter are counted using 

standard microscopy. The chemotactic index is calculated by dividing the number of migratory cells 
counted when DITHP is present in the lower compartment by the number of migratory cells counted 
when only media is present in the lower compartment. The chemotactic index is proportional to the 
activity of DITHP. 

25 Alternatively, cell lines or tissues transformed with a vector containing dithp can be assayed for 

DITHP activity by immunoblotting. Cells are denatured in SDS in the presence of P-mercaptoethanol, 
nucleic acids removed by ethanol precipitation, and proteins purified by acetone precipitation. Pellets 
are resuspended in 20 mM tris buffer at pH 7.5 and incubated with Protean G-Sepharose pre-coated 
with an antibody specific for DITHP. After washing, the Sepharose beads are boiled in electrophoresis 

3 o sample buffer, and the eluted proteins subjected to SDS-PAGE. The SDS-PAGE is transferred to a 

nitrocellulose membrane for immunoblotting, and the DITHP activity is assessed by visualizing and 
quantifying bands on the blot using the antibody specific for DITHP as the primary antibody and 125 I- 
labeled IgG specific for the primary antibody as the secondary antibody. 
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DITHP kinase activity is measured by phosphorylation of a protein substrate using y-labeled 
[ 32 P]-ATP and quantitation of the incorporated radioactivity using a radioisotope counter. DITHP is 
incubated with the protein substrate, [ 32 P]-ATP, and an appropriate kinase buffer. The [ 32 P] 
incorporated into the product is separated from free [ 32 P]-ATP by electrophoresis and the incorporated 
5 [ 32 P] is counted. The amount of [ 32 P] recovered is proportional to the kinase activity of DITHP in 
the assay. A determination of the specific amino acid residue phosphorylated is made by 
phosphoamino acid analysis of the hydrolyzed protein. 

In the alternative, DITHP activity is measured by the increase in cell proliferation resulting 
from transformation of a mammalian cell line such as COS7, HeLa or CHO with an eukaryotic 

10 expression vector encoding DITHP. Eukaryotic expression vectors are commercially available, and 
the techniques to introduce them into cells are well known to those skilled in the art The cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression of DITHP. Phase microscopy is then used to compare the mitotic index of transformed 
versus control cells. An increase in the mitotic index indicates DITHP activity. 

15 In a further alternative, an assay for DITHP signaling activity is based upon the ability of 

GPCR family proteins to modulate G protein-activated second messenger signal transduction 
pathways (e.g., cAMP; Gaudin, P. et al. (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding 
full length DITHP is transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or 
human embryonic kidney (HEK-293) cell lines) using methods well-known in the art. Transfected 

20 cells are grown in 12- well trays in culture medium for 48 hours, then the culture medium is discarded, 
and the attached cells are gently washed with PBS. The cells are then incubated in culture medium 
with or without ligand for 30 minutes, then the medium is removed and cells lysed by treatment with 
1 M perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using 
methods well-known in the art. Changes in the levels of cAMP in the lysate from cells exposed to 

2 5 ligand compared to those without ligand are proportional to the amount of DITHP present in the 
transfected cells. 

Alternatively, an assay for DITHP protein phosphatase activity measures the hydrolysis of P- 
nitrophenyl phosphate (PNPP). DITHP is incubated together with PNPP in HEPES buffer pH 7.5, in 
the presence of 0. 1 % p-mercaptoethanol at 37 °C for 60 min. The reaction is stopped by the addition of 
30 6mloflON NaOH, and the increase in light absorbance of the reaction mixture at 410 nm resulting 
from the hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance 
is proportional to the phosphatase activity of DITHP in the assay (Diamond, R.H. et al (1994) Mol Cell 
Biol 14:3752-3762). 
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An alternative assay measures DITHP-mediated G-protein signaling activity by monitoring 
the mobilization of Ca** as an indicator of the signal transduction pathway stimulation. (See, e.g., 
Grynkievicz, G. et al. (1985) J. Biol. Chem. 260:3440; McColl, S. et al. (1993) J. Immunol. 
150:4550-4555; and Aussel, C. et al. (1988) J. Immunol. 140:215-220). The assay requires 
5 preloading neutrophils or T cells with a fluorescent dye such as FURA-2 or BCECF (Universal 
Imaging Corp, Westchester PA) whose emission characteristics are altered by Ca** binding. When 
the cells are exposed to one or more activating stimuli artificially (e.g., anti-CD3 antibody ligation of 
the T cell receptor) or physiologically (e.g., by allogeneic stimulation), Ca** flux takes place. This 
flux can be observed and quantified by assaying the cells in a fluorometer or fluorescent activated cell 

10 sorter. Measurements of Ca** flux are compared between cells in their normal state and those 

transfected with DITHP. Increased Ca^ mobilization attributable to increased DITHP concentration 
is proportional to DITHP activity. 

DITHP transport activity is assayed by measuring uptake of labeled substrates into Xenorms 
laevis oocytes. Oocytes at stages V and VI are injected with DITHP mRNA (10 ng per oocyte) and 

is incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , ImM MgCl 2 , 
ImM Na^HPC^, 5 mM Hepes, 3.8 mM NaOH, 50fig/ml gentamycin, pH 7.8) to allow expression of 
DITHP protein. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM KC1, 
ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino 
acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 

2 o radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 
incorporated label, and comparing with controls. DITHP transport activity is proportional to the level 
of internalized labeled substrate. 

DITHP transferase activity is demonstrated by a test for galactosyltransferase activity. This 

2 5 can be determined by measuring the transfer of radiolabeled galactose from UDP-galactose to a 

GlcNAc-terminated oligosaccharide chain (Kolbinger, F. et al. (1998) J. Biol. Chem. 273:58-65). The 
sample is incubated with 14 \il of assay stock solution (180 mM sodium cacodylate, pH 6.5, .1 mg/ml 
bovine serum albumin, 0.26 mM UDP-galactose, 2 \il of UDP-pH]galactose), 1 fxl of MnCl 2 (500 
mM), and 2.5 \i\ of GlcNAcpO-(CH 2 ) 8 -C0 2 Me (37 mg/ml in dimethyl sulfoxide) for 60 minutes at 

30 37 °C. The reaction is quenched by the addition of 1 ml of water and loaded on a C18 Sep-Pak 
cartridge (Waters), and the column is washed twice with 5 ml of water to remove unreacted UDP- 
[ 3 H]galactose. The [ 3 H]galactosylated GlcNAcpO-(CH 2 ) 8 -C0 2 Me remains bound to the column during 
the water washes and is eluted with 5 ml of methanol. Radioactivity in the eluted material is measured 
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by liquid scintillation counting and is proportional to galactosyltransferase activity in the starting 
sample. 

In the alternative, DITHP induction by heat or toxins may be demonstrated using primary 
cultures of human fibroblasts or human cell lines such as CCL-1 3, HEK293, or HEP G2 (ATCC). To 
5 heat induce DITHP expression, aliquots of cells are incubated at 42 °C for 15, 30, or 60 minutes. 
Control aliquots are incubated at 37 °C for the same time periods. To induce DITHP expression by 
toxins, aliquots of cells are treated with 100 uM arsenite or 20 mM azetidine-2-carboxylic acid for 0, 
3, 6, or 12 hours. After exposure to heat, arsenite, or the amino acid analogue, samples of the treated 
cells are harvested and cell lysates prepared for analysis by western blot. Cells are lysed in lysis 

10 buffer containing 1% Nonidet P-40, 0.15 M NaCl, 50 mM Tris-HCl, 5 mM EDTA, 2 mM 

N-ethylmaleimide, 2 mM phenylmethylsulfonyl fluoride, 1 mg/ml leupeptin, and 1 mg/ml pepstatin. 
Twenty micrograms of the cell lysate is separated on an 8% SDS-PAGE gel and transferred to a 
membrane. After blocking with 5% nonfat dry milk/phosphate-buffered saline for 1 h, the membrane 
is incubated overnight at 4°C or at room temperature for 2-4 hours with a 1 : 1000 dilution of 

1 5 anti-DITHP serum in 2% nonfat dry milk/phosphate-buffered saline. The membrane is then washed 
and incubated with a 1 : 1000 dilution of horseradish peroxidase-conjugated goat anti-rabbit IgG in 2% 
dry milk/phosphate-buffered saline. After washing with 0. 1 % Tween 20 in phosphate-buffered saline, 
the DITHP protein is detected and compared to controls using chemiluminescence. 

Alternatively, DITHP protease activity is measured by the hydrolysis of appropriate synthetic 

2 o peptide substrates conjugated with various chromogenic molecules in which the degree of hydrolysis 

is quantified by spectrophotometric (or fluorometric) absorption of the released chromophore 
(Beynon, R.J. and J.S. Bond (1994^ Proteolytic Enzymes: A Practical Approach. Oxford University 
Press, New York, NY, pp.25-55). Peptide substrates are designed according to the category of 
protease activity as endopeptidase (serine, cysteine, aspartic proteases, or metalloproteases), 
25 aminopeptidase (leucine aminopeptidase), or carboxypeptidase (carboxypeptidases A and B, 

procollagen C-proteinase). Commonly used chromogens are 2-naphthylaniine, 4-nitroaniline, and 
furylacrylic acid. Assays are performed at ambient temperature and contain an aliquot of the enzyme 
and the appropriate substrate in a suitable buffer. Reactions are carried out in an optical cuvette, and 
the increase/decrease in absorbance of the chromogen released during hydrolysis of the peptide 

3 o substrate is measured. The change in absorbance is proportional to the DITHP protease activity in the 

assay. 

In the alternative, an assay for DITHP protease activity takes advantage of fluorescence 
resonance energy transfer (FRET) that occurs when one donor and one acceptor fluorophore with an 
appropriate spectral overlap are in close proximity. A flexible peptide linker containing a cleavage 
3 5 site specific for PRTS is fused between a red-shifted variant (RSGFP4) and a blue variant (BFP5) of 
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Green Fluorescent Protein. This fusion protein has spectral properties that suggest energy transfer is 
occurring from BFP5 to RSGFP4. When the fusion protein is incubated with DITHP, the substrate is 
cleaved, and the two fluorescent proteins dissociate. This is accompanied by a marked decrease in 
energy transfer which is quantified by comparing the emission spectra before and after the addition of 
5 DITHP (Mitra, R.D. et al (1996) Gene 173: 13-17). This assay can also be performed in living cells. 
In this case the fluorescent substrate protein is expressed constitutively in cells and DITHP is 
introduced on an inducible vector so that FRET can be monitored in the presence and absence of 
DITHP (Sagot, L et al (1999) FEBS Lett. 447:53-57). 

A method to determine the nucleic acid binding activity of DITHP involves a polyacrylamide 

10 gel mobility-shift assay. In preparation for this assay, DITHP is expressed by transforming a 

mammalian cell line such as COS7, HeLa or CHO with a eukaryotic expression vector containing 
DITHP cDNA. The cells are incubated for 48-72 hours after transformation under conditions 
appropriate for the cell line to allow expression and accumulation of DITHP. Extracts containing 
solubilized proteins can be prepared from cells expressing DITHP by methods weD known in the art. 

is Portions of the extract containing DITHP are added to [ 32 P]-labeled RNA or DNA. Radioactive 
nucleic acid can be synthesized in vitro by techniques well known in the art. The mixtures are 
incubated at 25 °C in the presence of RNase- and DNase-inhibitors under buffered conditions for 5-10 
minutes. After incubation, the samples are analyzed by polyacrylamide gel electrophoresis followed 
by autoradiography. The presence of a band on the autoradiogram indicates the formation of a 

2 o complex between DITHP and the radioactive transcript A band of similar mobility will not be 

present in samples prepared using control extracts prepared from untransformed cells. 

In the alternative, a method to determine the methylase activity of a DITHP measures transfer 
of radiolabeled methyl groups between a donor substrate and an acceptor substrate. Reaction 
mixtures (50 pi final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgCl 2 , 10 mM dithiothreitol, 
25 3% polyvinylalcohol, 1.5 \xCi [methyl- 3 H}AdoMet (0.375 AdoMet) (DuPont-NEN), 0.6 ^g 

DITHP, and acceptor substrate (e.g., 0.4 ^ig [ 35 S]RNA, or 6-mercaptopurine (6-MP) to 1 mM final 
concentration). Reaction mixtures are incubated at 30 °C for 30 minutes, then 65 °C for 5 minutes. 
Analysis of [methyl- 3 H]RNA is as follows: 1) 50 nl of 2 x loading buffer (20 mM Tris-HCl, pH 7.6, 1 
M LiCl, 1 mM EDTA, 1% sodium dodecyl sulphate (SDS)) and 50 jlU oligo d(T)-cellulose (10 mg/ml 

3 0 in 1 x loading buffer) are added to the reaction mixture, and incubated at ambient temperature with 

shaking for 30 minutes. 2) Reaction mixtures are transferred to a 96- well filtration plate attached to a 
vacuum apparatus. 3) Each sample is washed sequentially with three 2.4 ml aliquots of 1 x oligo d(T) 
loading buffer containing 0.5% SDS, 0.1% SDS, or no SDS. and 4) RNA is eluted with 300 pi of 
water into a 96-well collection plate, transferred to scintillation vials containing liquid scintillant, and 
35 radioactivity determined. Analysis of [methyl 3 H]6-MP is as follows: 1) 500 pi 0.5 M borate buffer, 
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pH 10.0, and then 2.5 ml of 20% (v/v) isoamyl alcohol in toluene are added to the reaction mixtures. 
2) The samples mixed by vigorous vortexing for ten seconds. 3) After centrifugation at 700g for 10 
minutes, 1 .5 ml of the organic phase is transferred to scintillation vials containing 0.5 ml absolute 
ethanol and liquid scintillant, arid radioactivity determined, and 4) Results are corrected for the 
5 extraction of 6-MP into the organic phase (approximately 41 %). 

An assay for adhesion activity of DITHP measures the disruption of cytoskeletal filament 
networks upon overexpression of DITHP in cultured cell lines (Rezniczek, G.A. et ai. (1998) J. Cell 
Biol. 141 :209-225). cDNA encoding DITHP is subcloned into a mammalian expression vector that 
drives high levels of cDNA expression. This construct is transfected into cultured cells, such as rat 

10 kangaroo PtK2 or rat bladder carcinoma 804G cells. Actin filaments and intermediate filaments such 
as keratin and vimentin are visualized by immunofluorescence microscopy using antibodies and 
techniques well known in the art. The configuration and abundance of cytoskeletal filaments can be 
assessed and quantified using confocal imaging techniques. In particular, the bundling and collapse 
of cytoskeletal filament networks is indicative of DITHP adhesion activity. 

is Alternatively, an assay for DITHP activity measures the expression of DITHP on the cell 

surface. cDNA encoding DITHP is transfected into a non-leukocytic cell line. Cell surface proteins 
are labeled with biotin (de la Fuente, M.A. et al. (1997) Blood 90:2398-2405). Immunoprecipitations 
are performed using DITHP-specific antibodies, and immunoprecipitated samples are analyzed using 
SDS-PAGE and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 

20 immunoprecipitant is proportional to the amount of DITHP expressed on the cell surface. 

Alternatively, an assay for DITHP activity measures the amount of cell aggregation induced 
by overexpression of DITHP. In this assay, cultured cells such as NIH3T3 are transfected with cDNA 
encoding DITHP contained within a suitable mammalian expression vector under control of a strong 
promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 

2 5 Fluorescent Protein (CLONTECH), is useful for identifying stable transf ectants. The amount of cell 
agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of DITHP activity. 

DITHP may recognize and precipitate antigen from serum. This activity can be measured by 
the quantitative precipitin reaction (Golub, E.S. et al. (1987) Immunology: A Synthesis . Sinauer 

30 Associates, Sunderland MA, pages 113-1 15). DITHP is isotopically labeled using methods known in 
the art. Various serum concentrations are added to constant amounts of labeled DITHP. DITHP- 
antigen complexes precipitate out of solution and are collected by centrifugation. The amount of 
precipitable DITHP-antigen complex is proportional to the amount of radioisotope detected in the 
precipitate. The amount of precipitable DITHP-antigen complex is plotted against the serum 
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concentration. For various serum concentrations, a characteristic precipitation curve is obtained, in 
which the amount of precipitable DITHP-antigen complex initially increases proportionately with 
increasing serum concentration, peaks at the equivalence point, and then decreases proportionately with 
further increases in serum concentration. Thus, the amount of precipitable DITHP-antigen complex is 
5 a measure of DITHP activity which is characterized by sensitivity to both limiting and excess quantities 
of antigen. 

A microtubule motility assay for DITHP measures motor protein activity. In this assay, 
recombinant DITHP is immobilized onto a glass slide or similar substrate. Taxol-stabilized bovine 
brain microtubules (commercially available) in a solution containing ATP and cytosolic extract are 
10 perfused onto the slide. Movement of microtubules as driven by DITHP motor activity can be 
visualized and quantified using video-enhanced light microscopy and image analysis techniques. 
DITHP motor protein activity is directly proportional to the frequency and velocity of microtubule 
movement. 

Alternatively, an assay for DITHP measures the formation of protein filaments in vitro . A 
1 5 solution of DITHP at a concentration greater than the "critical concentration" for polymer assembly is 
applied to carbon-coated grids. Appropriate nucleation sites may be supplied in the solution. The 
grids are negative stained with 0.7% (w/v) aqueous uranyl acetate and examined by electron 
microscopy. The appearance of filaments of approximately 25 nm (microtubules), 8 nm (actin), or 10 
nm (intermediate filaments) is a demonstration of protein activity. 

2 o DITHP electron transfer activity is demonstrated by oxidation or reduction of NADP. 

Substrates such as Asn-PGal, biocytidine, or ubiquinone- 10 may be used. The reaction mixture 
contains 1-2 mg/ml HORP, 15 mM substrate, and 2.4 mM NAD(P) + in 0.1 M phosphate buffer, pH 
7.1 (oxidation reaction), or 2.0 mM NAD(P)H, in 0.1 M Na^HPC^ buffer, pH 7.4 (reduction reaction); 
in a total volume of 0. 1 ml. FAD may be included with NAD, according to methods well known in the 
25 art. Changes in absorbance are measured using a recording spectrophotometer. The amount of 

NAD(P)H is stoichiometrically equivalent to the amount of substrate initially present, and the change in 
A^o is a direct measure of the amount of NAD(P)H produced; AA^ = 6620[NADH]. DITHP activity 
is proportional to the amount of NAD(P)H present in the assay. The increase in extinction coefficient 
of NAD(P)H coenzyme at 340 nm is a measure of oxidation activity, or the decrease in extinction 

3 o coefficient of NAD(P)H coenzyme at 340 nm is a measure of reduction activity (Dalziel, K. (1963) J. 

Biol. Chem. 238:2850-2858). 

DITHP transcription factor activity is measured by its ability to stimulate transcription of a 
reporter gene (Liu, H.Y. et al. (1997) EMBO J. 16:5289-5298). The assay entails the use of a well 
characterized reporter gene construct, LexA^-LacZ, that consists of LexA DNA transcriptional control 
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elements (LexA^) fused to sequences encoding the E. coli LacZ enzyme. The methods for constructing 
and expressing fusion genes, introducing them into cells, and measuring LacZ enzyme activity, are well 
known to those skilled in the art. Sequences encoding DITHP are cloned into a plasmid that directs the 
synthesis of a fusion protein, Lex A-DITHP, consisting of DITHP and a DNA binding domain derived 
from the LexA transcription factor. The resulting plasmid, encoding a LexA-DITHP fusion protein, is 
introduced into yeast cells along with a plasmid containing the LexA^-LacZ reporter gene. The amount 
of LacZ enzyme activity associated with LexA-DITHP transfected cells, relative to control cells, is 
proportional to the amount of transcription stimulated by the DITHP. 

Chromatin activity of DITHP is demonstrated by measuring sensitivity to DNase I (Dawson, 
B.A. et al. (1989) J. Biol. Chem. 264:12830-12837). Samples are treated with DNase I, followed by 
insertion of a cleavable biotinylated nucleotide analog, 5-[(N-biotinamido)hexanoamido-ethyl-l,3- 
thiopropionyl-3-aminoallyl]-2*-deoxyuridine 5' -triphosphate using nick-repair techniques well known 
to those skilled in the art Following purification and digestion with EcoRI restriction endonuclease, 
biotinylated sequences are affinity isolated by sequential binding to streptavidin and biotincellulose. 

Another specific assay demonstrates the ion conductance capacity of DITHP using an 
electrophysiological assay. DITHP is expressed by transforming a mammalian cell line such as 
COS7, HeLa or CHO with a eukaryotic expression vector encoding DITHP. Eukaryotic expression 
vectors are commercially available, and the techniques to introduce them into cells are well known to 
those skilled in the art. A small amount of a second plasmid, which expresses any one of a number of 
marker genes such as 0-galactosidase, is co-transformed into the cells in order to allow rapid 
identification of those cells which have taken up and expressed the foreign DNA. The cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression and accumulation of DITHP and J5-galactosidase. Transformed cells expressing P- 
galactosidase are stained blue when a suitable colorimetric substrate is added to the culture media 
under conditions that are well known in the art. Stained cells are tested for differences in membrane 
conductance due to various ions by electrophysiological techniques that are well known in the art. 
Untransformed cells, and/or cells transformed with either vector sequences alone or 0-galactosidase 
sequences alone, are used as controls and tested in parallel. The contribution of DITHP to cation or 
anion conductance can be shown by incubating the cells using antibodies specific for either DITHP. 
The respective antibodies will bind to the extracellular side of DITHP, thereby blocking the pore in 
the ion channel, and the associated conductance. 

XV, Functional Assays 
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DITHP function is assessed by expressing dithp at physiologically elevated levels in 
mammalian cell culture systems. cDNA is subcloned into a mammalian expression vector containing a 
strong promoter that drives high levels of cDN A expression. Vectors of choice include pCMV SPORT 
(Life Technologies) and pCR3. 1 (Invitrogen Corporation, Carlsbad C A), both of which contain the 
5 cytomegalovirus promoter. 5-10 ng of recombinant vector are transiently transfected into a human cell 
line, preferably of endothelial or hematopoietic origin, using either liposome formulations or 
electroporation. 1-2 |ug of an additional plasmid containing sequences encoding a marker protein are 
co-trans fected 

Expression of a marker protein provides a means to distinguish transfected cells from 
10 nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. 

Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; CLONTECH), CD64, or a 
CD64-GFP fusion proteia Flow cytometry (FCM), an automated laser optics-based technique, is used 
to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of the 
cells and other cellular properties. 
15 FCM detects and quantifies the uptake of fluorescent molecules that diagnose events preceding 

or coincident with cell death. These events include changes in nuclear DN A content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
20 proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry . Oxford, 
New York NY. 

The influence of DITHP on gene expression can be assessed using highly purified populations 
25 of cells transfected with sequences encoding DITHP and either CD64 or CD64-GFP. CD64 and 

CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Inc., Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
30 Expression of mRNA encoding DITHP and other genes of interest can be analyzed by northern analysis 
or microarray techniques. 

XVI. Production of Antibodies 

175 



WO 01/21836 PCT/US00/25643 



DITHP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to . 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the DITHP amino acid sequence is analyzed using L ASERGENE software 
5 (DNASTAR) to determine regions of high immunogenicity, and a corresponding peptide is synthesized 
and used to raise antibodies by means known to those of skill in the art. Methods for selection of 
appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well described in 
the art. (See, e.g., Ausubel, 1995, supra . Chapter 11.) 

Typically, peptides 15 residues in length are synthesized using an ABI 431 A peptide 
10 synthesizer (PE Biosystems) using fmoc-chemistry and coupled to KLH (Sigma) by reaction with N- 
maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., 
Ausubel, supra .") Rabbits are immunized with the peptide-KLH complex in complete Freund's 
adjuvant. Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to 
plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio- 
15 iodinated goat anti-rabbit IgG. Antisera with antipeptide activity are tested for anti-DITHP activity 
using protocols well known in the art, including ELIS A, RIA, and immunoblotting. 

XVII. Purification of Naturally Occurring DITHP Using Specific Antibodies 

Naturally occurring or recombinant DITHP is substantially purified by immunoaffinity 
20 chromatography using antibodies specific for DITHP. An immunoaffinity column is constructed by 

covalently coupling anti-DITHP antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). Alto the coupling, the resin is 

blocked and washed according to the manufacturer's instructions. 

Media containing DITHP are passed over the immunoaffinity column, and the column is 

2 5 washed under conditions that allow the preferential absorbance of DITHP (e.g., high ionic strength 

buffers in the presence of detergent). The column is eluted undo* conditions that disrupt 
antibody/DITHP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and DITHP is collected. 

3 o XVIII* Identification of Molecules Which Interact with DITHP 

DITHP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent 
(See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled DITHP, washed, and 
any wells with labeled DITHP complex are assayed. Data obtained using different concentrations of 
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DITHP are used to calculate values for the number, affinity, and association of D1THP with the 
candidate molecules. 

Alternatively, molecules interacting with DITHP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially 
5 available kits based on the two-hybrid system, such as the MATCHMAKER system (CLONTECH). 

DITHP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

10 

All publications and patents mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described method and system of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with specific preferred embodiments, it should 
is be understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or related fields are intended to be within the 
scope of the following claims. 



177 



, -J - . _ 



WO 01/21836 



PCT/US00/25643 



c 
o 



o 
c 



UJ 
< 



E 

O S2 
*fc= XT 

CO *" 

E o 

O 4= 

O CO 

m | 

o 
o 

O CM 
- . CO 



CO 
CM 
CM 



00 



LO 



Ok 

o 

CD 
cz 

CD 
O 

CO 



o o 



LU O 

Eg 

fl 

1.1 J LJ 
O) 

O CO 

5 £ 
$ g 

§ g 

CD t 

2 8 

§~ 

o o 
o 2 

D LU 



Q 55 S 



o < 

CD Z 

> Q 

O O 

5 §> 

"E f 

CO € 



CO 



cz 
o 
E 

3 



o 



CD 

o 

CO 

D 

E 
P 



~* CO 

i I 

E g 
o E 
o o 

£ £. 

O- CD 

° 8 
O o 

CQ "D 

~ i 

< § 



-Q 

"cO 

8 
o 

CO 

o 

CZ 

o 



CD 

CO 

o 
a> 
o 
c 
o 

CD 

E 

o i 



% 
o 

$ 

<S CD 
CZ ^= 

o a 



a 

2 
o 



o 
3 
~o 
o 

a 

a> 
c 

CD CD 



o 
o 

LO 

o 



c 



a ~ 



0 
O 



O 
3 

a 

C 
CD 

2 
a 

CD co 

E ° 

§2 



o 
o 

ZD 
CO 

O 
ii — 

£ 

co 

D 

CD 

CO 



8 & 

™ lO 

Z co 

(D O 
C UJ 

o w 

si 

2 o 

r? co 



CO 



LU 



Q 
O 

CO 

C 
CD 

Si 

m Q 



CO 

C 

CD 

"a 

8 

o 

E 
o 
n= 

CM 

a> 

cz 

CD 

a> 
a> 

co ~r 

1 £ 
(J) Ca- 
ul CD 
^ O 

o 2 

8£ 

^ o 

> — 

2 8 

CD 5 

"to CD 
D C 



W GO 

co 



5 

CM 
i 

LU 



o o 

i i 

LU LU 

O O 

o — 



CO 

3 

o 

CD 

CD 

o 
cz 

co 

z> 
4= 

g 

cz 

2 



1 -8 Q E 



cz 

I"? 

CD > 

1 £ 

O D 

E £ 

2 o 
o O 



CO 

C 

o 

CL 
D 

CO 

O 

E 
o 

□c 

CM 

c 

Z3 

_Q 

ZJ 
co 

o 

D 

Z5 

a> 

CD 



CD 

CO 

a 
a 

CL 

CO 

o 

CD 



c 

Z3 
JD 
Z3 

CO 

O 



D 

D 

O 

<D 

CO 

O 



CD 

CO 

O 

CZ 



o ^ 

£1 CO 

Q. CD 



CL 

CO 

O 
JZ 
Q. 

si 

CO -4— 

a 2 

CD q. 

O a> 

CD p 
co CZ 

° s 

cr 2 
O c: 

a $ 



Q 
O 

CO 



CO 



co 

Z3 
O 

CO 

3 

E 

CO 
Z3 



co 

z> 
o 

CO 

ZJ 



CO 

3 



r: ^ ^ 

CD O) ^ 
C co O 
"co 3 ^ 

2 D w 

II 8 



o 

CL D 

C CM 

sf= O 

TT cm 

O LL_ 

C Z 



CO 



c c c 
O © "© 
O O O 

a a a 

O 5 CD 
0)0 0) 

c c c 

q= v= 

O O O u- 
C C C £J 
-R3 T3 O 



o 
o 

CD 
CZ 
Li_ 

O 



"CD 



tS 2 

« 2- 

co CD 

I ^ 

CO 

CM 7" 

CM Q 

— > C 



CM O 
CO 



^<>§iOCslg^cOcOOCOCog^^r-iOCN 
UJ UJ , , UJ LU * V UJ LU LU UJ LU LU LU LL _ 

_ uj uj 0 0 0 0 0 UJ uj ^ 0 0 0 0 



co 



i^. CM 



CM — ^ O CM 



/ CO - O N 



CD 

n 
E 

3 

z: 
0 

Q 
<D 
D 

a 
£ 

.CD 



LO 
CO 

co 
CD 



8 



co 
to 




<0 CO N OO 

r^. "3 o o o 

CM O N ^ ^> 

— tO CM O CO 

O O CM CO CM 

CM O O ^ o 

I s * CN I — CN N 

o> a> a> o> D) 




(>Ch<)r^CM^a<)C>o2£cMCMSf:CM 

9? rz Q g^^ op p V, ^ Qr ^ ^ <> 



OOCOCOCN2^<>iOcoSR§cOCM2g 
DQ)DO)a) a, 0)0)D 0)D) D)0) a)D) 




co co 
CM CM 



o o o a o o 

CD CD CD CD CD CD 



^ "D "D "D "D "D "D 



O CO ^ N r- o 

co ^ lO iO — O 

CM O CO O O CM 

o o* co o C> 

— 6- ^ o o o* 

O CO CO O O 



lO CO 

co 
O 

ss 

co CO 
LO CM 
D> CD 



O O 
CD CD 
"D U 



<5 CO 
co 

CM O 

3 



6 



Q — 

a 

UJ 

CO 



CM CO 



io o n co 



/vO«-CMc0^iOONCOOp^cMc0^iOOrs 



178 



WO 01/21836 



PCT/US00/25643 



CD 
< 



c 
o 



o 
c 
c 

< 




IU .» LU HI LU .'. Ill III III 



CO 



O O g 



CM O 
co co 

I I 



CO 

i 



o 



LU UU LU .». ' LU UJ UJ n LU Lil n LU 



Q o 



O 



OO 



© 
-Q 

E 
z? 

o 

Q 

.£ 

D 

a 

E 
.© 



O 



£ 3 5 « ~ £ 

CD « — co 



o o 

r*. o co 



CO N CO 

-sr 



CM O ' — 

GO CO CM 



■ — • — Tkvi n^N cn co c&T^T^vi^r^oco 



o 

© 
"D 



a 

<D 
"D 



O 
(D 
T3 
CM 



O 
CD 
T3 



O O O O 
© © © © 
"D T3 "D T3 



© 
<> 



O O 
© © 
"O ~0 



O 
© 



© 




o o o 

© © © 

"0X5-0 

^ CM CO 

(> lO 

— CO CM 

CO N lO 

CM CO O 

O CO O 

CO CM CO 



co 

o o 

LU LU 

§ co 

l< CM 



CO iO 

co 

o 
iO r*-» 
O co 

■•— CO 



O O 

© © 

<5 to 

lo oo 

O CM 

s ° 

cm 



O CM 

o g g o 
o o 

CM 



O lO iO O co 
lO "ST CO CM O 



1 1 1 1 1 1 i n 1 1 1 1 1 1 

82888 

CM ^dF <> CO — 



- S cm g g jO 
< g ) 0> r ^ ) 0)° )a) 0)0) r o ) 

O jj> CD © O 

O T3 -g -g O 

— : o *sf 

E 2 R g S 

^ Q c> lo °? 

lo <5 ^ 



o 
o 



o o o 
© © © 

"D "O "D 
<) CM CM CM 

00 fM 

3 5 S * 

CO <- n ^ 



=5f * « 



Q 

a 

UJ 

CO 



cOO*O^CMcO'^tcO<>r^COC>0*--CMcO 
CMCMcOCOCOcOcOcOCOCOCOCO-^TtTT^r 



179 



•^r lo 



-or^oooo — cmco 



s 



WO 01/21836 



PCT/US00/25643 



c 
o 



o 
c 
a 
< 



CO 

< 



CO 

D 

CO 

c 

D 

1 — 

t 

CD 
C 



< 

E 



CO 

3 



_CD _ 
Q- CL 

O O 
to 

O 

E 



CO 

< 
CO 



O 
.c 

Q. 
O 
c 
0 
o 

Q_ 

CD 
C 

t> 

c 



O ~F\ 



c 

2 
a 

o 

<D 



E 

£ 
o 

CO 

3 2 

3 CL 
O o) 

3 c c 
E "o © 

CO ~ "f^ 

2< g- 

O 05 CL 

a c ^ 
_ <*= < 

Q Q ? 
^15 
2: ex. < 



CO 

c 

CD 

s 

o 
£ 



o 

D 
0 

a 



C 

2 

o_ 
o 

o 
z> 
a 

£ 

_ D 0 

^ 8 I 

CD O ~g 
O CD i 

<D -L 

o 2 
2 o> 

< -c o 



3 



CD CD 



O 
CO 
- Q_ 

si 



to 

< 

uj 
u_ 

co 



O 

Z < 



1 



CO 



co 



2 

Q_ 
O 



3 

C 

< 



06 ^o* 

^ a £o 

CO = /""\ 

o t m 

x D 111 

0) ^- 7 

CO 

CD C O 

c o ^ 

« I z 

to Q. ^ 

£ Ro 



^= CM IC g5 «J ^ 

== a> *— — !Z 



2 8 o o s o 

q CO °P °P 8 <3 - 
b? CM • — ,_J CO 



O UJ O 

o 

CM 
CO 



CO 
CO 



n 00 w to 

C3 CM Cp CO 

1X1 UU UJ UJ LLI 

• — - *~~ •" — cO cO « — - 



£ a 

E 
o 
o 

^_ x 

E l 

O 
O 

c 
a> 

c 2 
CD Q- 

2 3 

-a -g 
£ o 

D g? 

O O 

O i 

% P 

9 a 

is 
si 



CO 

C 

a 
o 

to 

o 

E 
o 



CO jz 

§i 

x £ 

CM 



a> a 

o o 

CO 

2 0 

O CO 

O CD 

C Q_ 

0 CL 
.00 3 

> ^ 

1 § 
O -2 



°? lL ^ 

(Dog 
CM CO ^ 



CD 

E 
3 



CD 

Q 
0) 
D 
CL 

E 



d 



CO 

o 



CO O co lO OJ 

cMcor^<5io^:^r^S2 

ipiiiiiii 

^ co co 10 co co 7^ >o 



c> 
o 

CM 
O 



N N OO lO U) CO 
O « — O N O 
O CM CO ^ cO O 

O co 
^ <D O co CO 

CO r— IO 



^ co co 10 00 co ^7^0 ifi co <t t8 



8 



cd £ a> 

<i 3 «i 

Q ^ co 
O S CM 

r-. o 

sis 




Q 

LU 

CO 



SO r-. 
to m 



CO <> Q - — CM CO 

to 10 5 o o o ^5 



lq o co o o — 
•O o <5 o o 

180 



WO 01/21836 



PCT/US00/25643 



<D 

D 

> 



O £ CO 
CO ^ O 



rv. o «— 
o co m 



CM 



CM — 
LO — 



LU LU LU 



oOLOCMtO-— co*^r^joooo 
r^oor— csir— o^rooooo 
.} ...» . » . * » * 



UJLUUJUJLiJLiJiiJLUUJIiJLllUJLlJ 



Q5000000000000000000 
co^cqO;piqiqcqoaq^oo^oq<)N<)' — cm 



o o 
<? 9 

LU 



CM LO CO 

7 9 9 



N lO N 
CO CO O 



co o 



o 
co 



UJ LLI LU LU LU 



OOOOOOOOOOOO 
•OCMLOOOcOOOO^^Or^OO 



CM 

y 

GO 
< 



C 

o 



S I 

"o o> 

f I « 

Q -c Q- 

CD 

g cm -r, 

?F ZJ 

CD c 
Q_ E 
co ^ 

§ 5 

co CD 

3 E 

S 1 

co 5 

8 £ 
£ o 

CL 
CL 

x o 1 

? °l * 

£ o — 1 

£ T 5 

o 



CD 

E 

& 
c 

CD 

-#— co 

C CD 

CD <2 

?E 

CD 

Q_ CD 
CD 

CL P 

— 1 ^" 
CL -C 

E x> 

P 

o o 
> CD 



CL 

I- 1 (D 
O 



CD 

CO 

o 
o 

CD 



8 
o 

o 

JZ 
CD 



to 

CM 



O 
CM 



"E 
S 

CD <g 

(D CD 
h= CL 



o 

CD 

o v 

2 o 
o $ 

O Of 

L. ■ ■ 



E 
g 

a 

■8 c 
£ g 

o 

o xj 
?> 8 

CD 35 
CD c 

£ -2 

E * 
E o 



o 

CM 

a> 2 

to I 
D CD 

P ° 

a 
g. 



•o 

U. Li- ^ 



b 
E 



» ^ o 



-*= 13 CM 
M1J O 



i 
o 

c 
o 

£ 
o 
■o 

a) 
c 

o o 

CD ~® 
O CD 

O C 

1? o 

CD Q_ 

o 9 
6 5 

-c z> 

CL LU 



co 

a 
c 

a 



3 



CD 

co 

O 



sj- 

O 

a 

c c 

CD O 

o E 
^% 

€1 

CD 9 

CO CO 



T3 
C 

I 



CD 

CO 

Q 



CD 

CO 



< 



c 
D 

E 



a o 

-Q 



CD CD CD CD CD CD <D CD CD CD CD 

aaaaaaa a a q_ q. 

^^^^^•^^ ^ ^ 

CM CM CM CM CM CM CM CM CM CM CM 

I I I I I I I XX X X 

CM CM CM CM CM CM CM CM CM CM CM 

OOOOOOO OO O O 

C^D>0)D)C3>D)D>0 O) O) Q O U) Q D) O 
OOOGOOO<00<<0<0< 



CM CM CM CM CM CM CM CM CM CM CM 

XXXXXXX0QXX0OCOXCOXCO 

cmcmcmcmcmcmcm<cmcm<r<ccm<cm< 
OOOOOOoSoOl*l*O^Ol* 

N N Tj N N N N N N M Rj 



CO 



® 
E 

D 



r-W(OCOCNJO^CSI'-'-rOJOJC\lr-CN|r-^CS(COCOC>|COCO<OW 

00000000000000000000000000000000 



P co 



CO 



LO 



nF^SSLOLO^r2^^^Lo^^co^°^o^rrc>o 

^CN^^;^^^.— CSl^CM~CvlC>^COCM^iO(>J 



CD © 



o o 

CD CD 
"O "O 



a O. CM CM 

Q. co co LQ 
C 

^ O ^ ^ 

® S co O 
K ^ CO CO 




o 


O 


O 


o 


,dec 


o o 


o 


CD 


CD 


CD 


CD 


CD CD 


CD 


T5 


"O 


TJ 


"O 


~q "o 


■q 


CO 


CO 




Ch 


o 






LO 


LO 




r< 


242082. 


c> c> 


co 






s 


O 


CO co 






LO 


o 


CM 

CO 








5 


o 


CM 


o o 


c> 

OO 



o 

CD 
"O 



O O 
<D CD 
"O U 



o 

CD 



o 
o 



T3 *D 



O 
CD 
"O 



O O 
CD CD 



"O 



O 
<D 
"O 



O 
CD 



O 
CD 



"O "O 



'"^'r^^'^oooocoO'OO 

LO LO LO ' — > — O CO N N ^ ^ U) 

LOLOOOOCMCMOOcOCOr^ 



co oo o o 



CM 



o o — 



QojcocoiONNcoaS 

LU 
GO 



^l0l0U)OC0OOO«-CNWc0^r^i0ONNC0C0C> 



181 




WO 01/21836 PCT/US00/25643 
/D^OOr-r^OO^OOr^OeoO 

gCp-r-^Or-OOr-r-COCM 

LU LU LLI LU LU QJ yj i±j LU LU LU LU 

^oopooooooooo 



OO 




CO 


>o 


O 


CO 






3 

• 


o 


g 


O 




O 


T CO 


2 


LO 




OJ 


9 


9 


o 
■ 


O 
i 




9 


o 
i 


1 




1 


9 


o 
i 


CN 
i 




i 






LU 


LU 


LU 


LU 


i 


LU 


LU 


LU 


UJ 


LU 


LU 


LU 


LU 


LU 




LU 


LU 


ULI 




O 


O 


O 


O 


LU 

8 


o 


o 


O 


o 


o 


O 


o 


o 






O 


o 


O 


8 






CM 


lO 






O 


CO 




CN 


to 


o 






CM 


CM 




<> 


l< 


CO 


<> 






id 


CO 


<> 


<> 


l< 


CM 




co" cm 






r< 





c 

£ 
o 



CM 

CD 
< 




Q-S^OOcOi— ^OlOO 

O§ioocotou5co- 



CO 



LO 



co -o o 
O CO <> 



o 

CO 



CM 

o 



CO 

CO 
CM 



o 



^Sowocopjg^^OLOioS 



^J^C>r^Lpco^r^.— r— o 
P — "srocor^o^rcor^ 
55J2cocM^r^cMoo>oco^r 



« — CO LO ■ — r— CO 

t^- • — co co co 
i— — co cm 



O 



co £ 
o 



o co o lo 



182 



co 

CM 



^ ^ r> ^ ^ 

CM CM °^ CO 



^ ^ s 

CM CM ■ — 



CD 



Q 
O 

LU 

CO 



8 


O 


o 


o 


o 


o 


O 


O 




<D 


CD 


CD 


<D 


CD 


CD 


<D 




"D 


■q 


"D 


"D 


"D 


"O 


"D 




CM 




CM 


CM 


co 


co 


co 


O 






O 


O 








LO 






CO 


GO 


<> 


t> 


C> 




§ 




O 
















O 


o 


CM 


CM 


CM 


CO 


8 




8 




8 


8 


O 
O 


o 


o 




CM 


CM 


CO 


CO 


co 


CM 


CO 


CO 


CO 


CO 


CO 


CO 


CO 



o o 

CD <D 
"O u 



CM CM 
CO CO 

o o 



o o 

CD CD 

CM CM 
CO CO 
O O 

S3 

CM CM 



o o o o o 

0) CD CD CD CD 

t< <> CM CM 

S c> 3 c> 

^ O <T i— CO 

CM LQ co co 

2 V O CN CO 

3 ^ co o co 

^ CO ^ CO CM 



o o 

CD CD 

"O "O 

CO co 

id lo 

CM CM 

LO LO 

CO CO 



o o 

CD CD 

co cd 

CO cd 

CM CM 

O O 

<> nO 

o o 



o o o 

CD CD CD 

"D "D T3 

CO CO cq 

-d cd cd 

« — co co 

o o o 

S 3 3 

^ CM CM 




^ to m n 



CO O — CM CO CO 

co co 



iQ^^^^^aooco^^fN 

^lOlOlOlOlOlolOlOO-OOO 



182 



WO 01/21836 



TABLE 3 



ID NO: 


Template ID 


Start 


Stop 


1 


405310.1. oct 


1091 


1159 




405310.1.oct 


1027 


1089 


-| 


4053 10.1. oct 


2079 


2150 


! 


4053 10.1. oct 


1217 


1276 




4053 10.1. oct 


1094 


1165 




405310.1 oct 


2062 


2151 


1 


405310.1 oct 


2919 


2972 


-, 


405310 1 oct 


2732 


2788 


1 


405310 l.oct 


1076 


1135 


1 


4053 10. l.oct 


2246 


2302 


•j 


4053 10. l.oct 


1419 


1487 


1 


4053 10. l.oct 


2952 


3020 


1 


4053 10. l.oct 


2086 


2154 


1 


4053 10. l.oct 


2934 


2993 


1 


4053 10. l.oct 


2940 


3020 


1 


4053 10. l.oct 


887 


955 




4053 10. l.oct 


2934 


2996 




40531 0.1. oct 


1091 


1159 




4053 10. l.oct 


1027 


1089 




4053 10. l.oct 


2079 


2150 




4053 10. l.oct 


1217 


1276 


! 


4053 10. l.oct 


1094 


1165 




405310.1.oct 


2062 


2151 




4053 10. l.oct 


2919 


2972 




4053 10. l.oct 


2732 


2788 




40531 0.1. oct 


1076 


1135 


1 


4053 10. l.oct 


2246 


2302 




4053 10. l.oct 


1419 


1487 


1 


4053 10. l.oct 


2952 


3020 


1 


4053 10. l.oct 


2086 


2154 


! 


4053 10. l.oct 


2934 


2993 


] 


4053 10. l.oct 


2940 


3020 


! 


4053 10. l.oct 


887 


955 


! 


4053 10. l.oct 


2934 


2996 


3 


33475 1.2.dec 


1476 


1532 


3 


334751 .2.dec 


675 


731 


3 


334751 .2.dec 


1532 


1606 


3 


334751 .2.dec 


2 


85 


3 


334751 .2.dec 


795 


857 


3 


334751 .2.dec 


675 


737 


3 


334751 .2.dec 


1625 


1681 


3 


334751. 2.dec 


783 


857 


4 


237330.8.dec 


683 


730 


5 


053778.11. dec 


1627 


1686 


5 


053778.11. dec 


1306 


1374 


5 


053778.11. dec 


1594 


1686 


5 


053778.11. dec 


1279 


1374 


6 


360645. lO.dec 


385 


453 


6 


360645.10.doc 


415 


495 


7 


334808.1. dec 


529 


609 



183 



PCT/US00/25643 



Frame 


Domain Type 

9 1 


forward 2 


SP 


forward 1 


SP 


forward 3 


TM 


forward 2 


SP 


forward 2 


TM 


forward 1 


SP 


forward 3 


TM 


forward 2 


TM 


forward 2 


TM 


forward 2 


TM 


forward 3 


SP 


forward 3 


TM 


forward 1 


SP 


forward 3 


TM 


forward 3 


TM 


forward 2 


SP 


forward 3 


TM 


forward 2 


SP 


forward 1 


SP 


forward 3 


TM 


forward 2 


SP 


forward 2 


TM 


forward 1 


SP 


forward 3 


TM 


forward 2 


TM 


forward 2 


TM 


forward 2 


TM 


forward 3 


SP 


forward 3 


TM 


forward 1 


SP 


forward 3 


TM 


forward 3 


TM 


forward 2 


SP 


forward 3 


TM 


forward 3 


TM 


forward 3 


SP 


forward 2 


SP 


forward 2 


SP 


forward 3 


SP 


forward 3 


SP 


forward 2 


TM 


forward 3 


SP 


forward 2 


SP 


forward 1 


SP 


forward 1 


SP 


forward 1 


SP 


forward 1 


SP 


forward 1 


SP 


forward 1 


SP 


forward 1 


SP 



WO 01/21836 PCTYUS00/25643 



TABLE 3 



SEQ ID NO: 


Template ID 


Start 


Stop 


Frame 


Domain Type 


7 


334808.1. dec 


1734 


1817 


forward 3 


SP 


7 


334808..1 .dec 


56 


106 


forward 2 


SP 


7 


334808.1. dec 


56 


112 


forward 2 


SP 


7 


334808.1. dec 


56 


130 


forward 2 


SP 


7 


334808.1. dec 


56 


124 


forward 2 


SP 


7 


334808.1. dec 


56 


118 


forward 2 


SP 


10 


232851. 7.dec 


987 


1052 


forward 3 


TM 


10 


232851 .7.dec 


993 


1043 


forward 3 


TM 


10 


232851 .7.dec 


987 


1043 


forward 3 


TM 


11 


083804.1. dec 


841 


897 


forward 1 


TM 


11 


083804.1. dec 


565 


630 


forward 1 


SP 


11 


083804.1. dec 


706 


786 


forward 1 


TM 


11 


083804.1. dec 


235 


303 


forward 1 


TM 


11 


083804.1. dec 


829 


882 


forward 1 


TM 


11 


083804.1. dec 


218 


274 


forward 2 


SP 


11 


083804.1. dec 


244 


303 


forward 1 


TM 


11 


083804.1. dec 


968 


1039 


forward 2 


SP 


11 


083804.1. dec 


439 


522 


forward 1 


TM 


11 


083804.1. dec 


565 


648 


forward 1 


SP 


11 


083804.1. dec 


829 


900 


forward 1 


TM 


11 


083804.1. dec 


197 


289 


forward 2 


SP 


11 


083804.1. dec 


218 


298 


forward 2 


SP 


11 


083804.1. dec 


238 


291 


forward 1 


TM 


11 


083804.1. dec 


218 


277 


forward 2 


SP 


11 


083804.1. dec 


730 


789 


forward 1 


TM 


11 


083804.1. dec 


247 


309 


forward 1 


TM 


11 


083804.1. dec 


959 


1045 


forward 2 


SP 


12 


272721 Aoct 


3078 


3149 


forward 3 


SP 


12 


272721 Aoct 


936 


992 


forward 3 


TM 


12 


272721 Aoct 


1027 


1086 


forward 1 


TM 


13 


461603.4.oct 


2291 


2359 


forward 2 


TM 


13 


461603.4.oct 


2309 


2356 


forward 2 


TM 


14 


332465.2.dec 


2826 


2882 


forward 3 


SP 


14 


332465.2.dec 


2657 


2719 


forward 2 


TM 


19 


242082. lO.dec 


1840 


1923 


forward 1 


SP 


20 


01 9239.1. dec 


2043 


2123 


forward 3 


TM 


21 


899943.1. dec 


3543 


3617 


forward 3 


SP 


21 


899943.1. dec 


3530 


3610 


forward 2 


SP 


21 


899943.1. dec 


2824 


2877 


forward 1 


TM 


21 


899943.1. dec 


3321 


3380 


forward 3 


SP 


21 


899943.1. dec 


3543 


3602 


forward 3 


SP 


21 


899943.1. dec 


2836 


2892 


forward 1 


TM 


27 


901 978.1. dec 


1250 


1309 


forward 2 


SP 


28 


479346.1. dec 


467 


553 


forward 2 


SP 


29 


481 750.1. dec 


664 


717 


forward 1 


SP 


32 


900680.2.dec 


27 


77 


forward 3 


TM 


34 


053826.1. dec 


179 


253 


forward 2 


TM 


34 


053826.1. dec 


1354 


1416 


forward 1 


SP 


34 


053826.1. dec 


1354 


1416 


forward 1 


TM 


34 


053826.1. dec 


1351 


1401 


forward 1 


TM 
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TABLE 3 



SEQ ID NO: 


Template ID 


Start 


Stop 


Frame 


Domain Type 


34 


053826.1. dec 


485 


556 


forward 2 


SP 


34 


053826.1. dec 


485 


547 


forward 2 


SP 


34 


053826.1. dec 


1250 


1324 


forward 2 


TM 


34 


053826.1. dec 


194 


271 


forward 2 


TM 


34 


053826.1. dec 


485 


541 


forward 2 


SP 


34 


053826.1. dec 


173 


226 


forward 2 


TM 


34 


053826.1. dec 


485 


550 


forward 2 


SP 


34 


053826.1. dec 


1363 


1422 


forward 1 


TM 


34 


053826.1. dec 


194 


253 


forward 2 


TM 


36 


400607. 19.dec 


1192 


1248 


forward 1 


TM 


36 


400607.1 9.dec 


1198 


1248 


forward 1 


TM 


36 


400607. 19.dec 


1174 


1245 


forward 1 


TM 


37 


444248.7.dec 


36 


98 


forward 3 


SP 


37 


444248.7.dec 


36 


83 


forward 3 


SP 


37 


444248.7.dec 


15 


92 


forward 3 


SP 


37 


444248.7.dec 


15 


92 


forward 3 


SP 


37 


444248.7.dec 


36 


92 


forward 3 


SP 


38 


346599.9.dec 


122 


181 


forward 2 


SP 


38 


346599.9.dec 


128 


172 


forward 2 


SP 


38 


346599.9.dec 


128 


196 


forward 2 


SP 


38 


346599.9.dec 


128 


181 


forward 2 


SP 


39 


480344.2.dec 


1105 


1158 


forward 1 


TM 


39 


480344.2.dec 


122 


223 


forward 2 


SP 


40 


411396.24.dec 


1347 


1403 


forward 3 


SP 


40 


411396.24.dec 


1347 


1412 


forward 3 


SP 


40 


411396.24.dec 


1347 


1409 


forward 3 


SP 


41 


30281 9.4.dec 


2612 


2680 


forward 2 


SP 


42 


238734.2.dec 


1532 


1597 


forward 2 


SP 


43 


399525.3.dec 


1235 


1285 


forward 2 


SP 


45 


410628.5.dec 


1388 


1438 


forward 2 


TM 


45 


410628.5.dec 


209 


274 


forward 2 


SP 


45 


410628.5.dec 


235 


318 


forward 1 


SP 


46 


053649.6.dec 


6042 


6116 


forward 3 


TM 


46 


053649.6.dec 


4699 


4755 


forward 1 


TM 


46 


053649.6.dec 


4025 


4105 


forward 2 


TM 


46 


053649.6.dec 


4684 


4734 


forward 1 


TM 


46 


053649.6.dec 


6676 


6726 


forward 1 


TM 


46 


053649.6.dec 


4702 


4776 


forward 1 


TM 


46 


053649.6.dec 


6688 


6735 


forward 1 


SP 


46 


053649.6.dec 


3060 


3122 


forward 3 


TM 


46 


053649.6.dec 


6088 


6138 


forward 1 


TM 


47 


221914.2.dec 


430 


483 


forward 1 


TM 


47 


221914.2.dec 


932 


979 


forward 2 


SP 


47 


221914.2.dec 


544 


597 


forward 1 


SP 


47 


221914.2.dec 


430 


501 


forward 1 


SP 


47 


221914.2.dec 


544 


603 


forward 1 


TM 


47 


221914.2.dec 


442 


495 


forward 1 


SP 


47 


221914.2.dec 


430 


495 


forward 1 


SP 


48 


347748.2.dec 


345 


389 


forward 3 


SP 


48 


347748.2.dec 


315 


389 


forward 3 


SP 
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TABLE 3 



SEQ ID NO: 


Template ID 


Start 


Stop 


Frame 


Domain Type 


48 


347748 2.dec 


333 


389 


forward 3 


SP 


48 


347748 2 dec 


351 


410 


forward 3 


TM 


48 


347748 2 dec 


324 


389 


forward 3 


SP 


48 


347748 2 dec 


330 


398 


forward 3 


TM 


48 


347748 2 dec 


327 


389 


forward 3 


SP 


48 


347748 2 dec 


342 


392 


forward 3 


TM 


48 


347748 2 dec 


303 


389 


forward 3 


SP 


48 


347748 2 dec 


333 


395 


forward 3 

■ 1 WW V_p4 P \_4 


TM 


51 


41 1408 20 dec 


372 


422 


forward 3 


SP 


51 


41 1408 20 dec 


661 


732 


forward 1 


SP 


51 


41 1408.20.dec 


369 


434 


forward 3 


SP 


51 


41 1408.20.dec 


318 


428 


forward 3 


SP 


51 


41 1408 20 dec 


369 


428 


forward 3 


SP 


51 


41 1408.20.dec 


369 


422 


forward 3 


SP 


52 


035973.1. dec 


138 


224 


forward 3 


SP 


54 


387807. 4. oct 


469 


543 


forward 1 


SP 


54 


387807.4.oct 


346 


396 


forward 1 


SP 


54 


387807.4.oct 


346 


402 


forward 1 


SP 


54 


387807. 4.oct 


56 


148 


forward 2 


SP 


55 


406790.3.dec 


792 


854 


forward 3 


TM 


57 


196623.3.dec 


737 


808 


forward 2 


TM 


59 


264633.8.dec 


5524 


5583 


forward 1 


SP 


59 


264633.8.dec 


5209 


5277 


forward 1 


TM 


59 


264633.8.dec 


5195 


5248 


forward 2 


TM 


59 


264633.8.dec 


4877 


4939 


forward 2 


TM 


59 


264633.8.dec 


5178 


5249 


forward 3 


TM 


59 


264633.8-dec 


7156 


7218 


forward 1 


SP 


59 


264633.8.dec 


5203 


5280 


forward 1 


TM 


59 


264633.8.dec 


6597 


6647 


forward 3 


TM 


59 


264633.8.dec 


703 


759 


forward 1 


TM 


59 

Vp* * 


264633.8.dec 


5186 


5260 


forward 2 


TM 


59 


264633.8.dec 


5544 


5618 


forward 3 


SP 


59 


264633.8.dec 


5204 


5263 


forward 2 


TM 


59 


264633.8.dec 

■ ^p* v^p* 7 ^^ • ™» ■ ^p^ ^p" 


5218 


5277 


forward 1 


TM 


59 


264633.8.dec 


4877 


4933 


forward 2 


TM 


60 


337822.4.dec 


1461 


1532 


forward 3 


SP 


60 


337822.4.dec 


209 


286 


forward 2 


TM 


60 


337822.4.dec 


210 


281 


forward 3 


TM 


62 


256009.2.dec 


6728 


6790 


forward 2 


SP 


62 


256009.2.dec 


1974 


2033 


forward 3 


SP 


62 


256009 2 dec 


1317 


1409 


forward 3 


SP 


62 


256009.2.dec 


3693 


3746 


forward 3 


TM 


63 


231892.12.dec 


1009 


1071 


forward 1 


TM 


63 


231892.12.dec 


467 


556 


forward 2 


SP 


63 


231892.12.dec 


985 


1056 


forward 1 


TM 


64 


197445.1. oct 


2155 


2217 


forward 1 


TM 


64 


197445.1. oct 


2146 


2196 


forward 1 


TM 


64 


197445.1. Oct 


2380 


2436 


forward 1 


TM 


64 


197445.1. oct 


2158 


2214 


forward 1 


TM 


64 


197445.1. oct 


2155 


2217 


forward 1 


TM 
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TABLE 3 



SEQ ID NO: 


TemDlate ID 


Start 


Stop 


Frame 


Domain Type 


64 


197445 1 oct 


2146 


2196 


forward 1 


TM 


64 


197445 1 oct 


2380 


2436 


forward 1 


TM 


64 


197445 1 net 


2158 


2214 


forward 1 


TM 


65 


348775 1 oct 


232 


288 


forward 1 


TM 

■ 1 w 1 


65 


348775 1 oct 


937 


999 


forward 1 


TM 


65 


348775 1 oct 


253 


306 


forward 1 


SP 


65 


348775 1 oct 


1523 


1582 


forward 2 


TM 


65 


348775 1 oct 


942 


1010 


forward 3 


TM 

• * w 1 


65 


348775 1 oct 


989 


1048 


forward 2 


TM 

1 P V 1 


65 


348775 1 oct 


235 


282 


forward 1 


TM 


65 


348775 1 oct 


967 


1017 


forward 1 


TM 


65 


348775. 1 .oct 


907 


975 


forward 1 


TM 


66 


336239 5 dec 


1670 


1744 


forward 2 


SP 


66 


336239 5 dec 


1317 


1379 


forward 3 


TM 


66 


336239.5.dec 


2417 


2485 


forward 2 


SP 


66 


336239.5.dec 


1217 


1279 


forward 2 


TM 


66 


336239.5.dec 


2217 


2282 


forward 3 


SP 


66 


336239.5.dec 


1725 


1784 


forward 3 


TM 


66 


336239.5.dec 


2211 


2273 


forward 3 


TM 


66 


336239.5.dec 


852 


935 


forward 3 


TM 


66 


336239 5 dec 

VWV^.V' .V^.*»JWW 


2226 


2276 


forward 3 


TM 


68 


391940.2.dec 


2125 


2211 


forward 1 


SP 


69 


978302.3.dec 


2319 


2372 


forward 3 


TM 


70 


228629.11. dec 


917 


979 


forward 2 


TM 


70 


228629.11. dec 


944 


997 


forward 2 


TM 


70 


228629.11. dec 


917 


1009 


forward 2 


SP 


71 


011211.5.dec 


1515 


1580 


forward 3 


SP 


71 


011211.5.dec 


1515 


1586 


forward 3 


SP 


71 


011211.5.dec 


1515 


1598 


forward 3 


SP 


71 


011211.5.dec 


1515 


1577 


forward 3 


SP 
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WO 01/21836 



PCT7US00/25643 



Table 4 



SEQ 










SEQ 










ID 


Template ID 


Component 


Start 


Stop 


ID 


Template ID 


Compohent 


Start 


Stop 






in 






NO 




ID 






] 


fUJO IU. I ,uu 


A-i m ci ui 
41 UlOl rll 


i 


Ol 1 




*fUOO I U. I .OCl 


n01 1 *^1 RO 

y d. i i o i v>cL 


1 RRO 
I OOc 


P1 17 


1 


j.n^*^in 1 not 

**\J r j»D 1 U. 1 .OCl 


R7QOQ^ Q U1 1 

ooy^o4ori 1 


1 AO 


o/y 




40^*^10 1 AAt 

4UOO IU. I .OCl 


y^_o/ /ill 


1 RQR 

i oyo 


P1HQ 

I UZ7 




4UDO I U. I .OCT 


C/1 QCCAOUI1 

o4c5bbUorl 1 


OAO 


>l QA 
4o4 




V1ACQ1A 1 AAt 

4UOO IU. I .OCl 


4O/DD40n I 


1 «QQ 

i oyo 


POOR 
tUOO 


1 


^iacqia 1 
4UOo IU. 1 .OCt 


OjIQCI O-f LJ A 

<>4oblo I Ml 


ooy 






ylACTIA 1 r>r+* 

4UO01U. I -OCl 


onftl A/I 1-11 

b^ib I U4ri l 


1 OAO 

i yuo 


Ol 4ft 
tL I HO 




viacoia -1 rtj-i* 
4UOOIU.1 .OCT 


5iy40^3Ml 


404 


obO 




4UOO IU.1 .OCl 


I ooo^yori i 


1 QQft 

i yoo 


OIOP 

tzL 1 U^. 


] 


4U001U. I .oct 


Ci Q/|i 1 Oil ^ 

ol o41 ion 1 


/Ail 

404 


b4y 




ilA*^0.1A 1 r\r*>t 
4UOO I U. I .OCl 


H^ 1 OOHOrl 1 


1QO.O 

i yoy 




1 


4UD01U.1 .OCt 


a r~ f\ a r% o rr I M 

4b04obbM1 


oyy 


CCA 

boU 


] 


4U0O1U. 1 .OCl 


>I7n07R1 l_J 1 


1 Q7C 

i y /o 


OOOl 


1 


4053 1 0. l .OCt 


5o77iobM1 


1b00 


ibo4 


] 


40oo10.1 .OCt 


ooyooyOri 1 


1 QOQ 

lyoo 


OOOC 


1 


405310.1 .Oct 


g2525894 


4 coo 

1 5oo 


"t CCA 

1 bo4 




/I A CI HA 1 

4UoolO. I .OCt 


o<i/^UobMl 


1 QQC 

i yyo 


OOvlQ 


1 


405310.1 .OCt 


y« ■« OOCOQU1 

41*£ob*forn 


iboo 


■I COI 

i bol 




4UOO IU. I -OCl 


OO^iOOoori t 


oaoi 


Ol AO 
^ I Kjc. 


1 


405310.1 .OCt 




-1 Mil 

1 bo4 


lbb4 


* 


/IACQ1A 1 s\r*i 

4UOOIU. 1 .OCl 


Q1 1 Oflfi'JHI 

o i i ^obori i 


OAOO 


OOOO 


1 


405310.1 .OCt 


■4 OC7QOOLH 

i ob/ oton i 


•1 CA1 

Ibuo 


1bo4 


] 


A ACQ 1 A -1 /\ a4 

4Uoo 1 0 . 1 .OCt 


-I C7 AC7/I U1 1 

ib/UO/ 4n 1 






1 


405310. 1 .OCt 


lob/oy in! 


IbOo 


■1 CCA 
ID04 




/I ACQ A A ~i y\*-»t 

4Ubo IU. 1 .OCt 


ID/UDoorD 




^DOO 


] 


405310. 1 .OCt 


4uu/ oi ym 


1 R1 
1 0 I O 


1 fZQA 
I Do4 




/1ACQ1A 1 nrt 
'•■UOO IU. I .OCl 


I D / UDOOn 1 


OAOO 


PO,14 

ilO 1 H 


1 


40531 0. 1 .OCt 


gi i yo/oo 


1 b04 


01 AD 


1 


4ACQ1A 1 r\r<i 
WOO I U. I .OCl 


O l OOOHHn l 


^.1^1 


PORP 


1 


40531 0. 1 .OCt 


g4u/oU4<£ 


1 R*^7 

i bo/ 


01 AO 




/1ACQ1A 1 
4UOO I U. I -OCl 


74nQQOM1 




P417 
t4 1 / 




405310.1 .OCt 


b/U4^4Uril 


1 CCO 


i bo4 




./IACQ1A 1 nrf 
4UOOIU. I -OCl 


'X'XOOACl A l-l 1 


Ol 7c. 
^. I / O 


t4O0 




4053 lO.l .OCt 


i yuo i i i i b 


1 R7fl 
1D/U 


OAC/ 

^UD*!- 




4AC01 A 1 not 
4UOO I U. 1 .OCl 


^Houo/un i 


C-CJO I 


P4RA 
^HOU 




405310. 1 .OCt 


ii4o<iuU4 1 b 


1 Pi71 
ID/ I 






Afl^ O 1 not 
*KJOO IU. I .OCl 


O 1 OO 1 57 1 PI 1 


PO.OA 


cODO 




4UOO l U.l .OCt 


o/ ^iuyu4ri i 


1 ft71 
ID/ 1 


I OO I 




4AC**in 1 not 


7RQ4fipUi 




P^^R 

t-OJO 


J 


A AC01 r\ 1 rv^kf 

4053 I O.l .OCt 


i yuo I I i rb 


1 A77 
ID/ / 


01 AO 




Ancain 1 not 

4UDO I U. I .OCl 


ri 41 om 

0 1 0 1 h 1 tn 1 


0071 




] 


/iacqia 1 not 

4UD01 U. 1 .OCl 


i yuo 1 1 In 1 


1R77 
lb// 


1 QAA 




40^9.10. 1 not 

4UOO 1 \J. 1 .OCl 


1QP1 CTOUII 
1 10 / on 1 




PRR°. 




^n^Q1A 1 not 
hUOO I U- I .OCl 


f I DUOun 1 




1 DO*+ 


1 


40^10 1 not 


n1 S170^4 
y i 0 1 1 uoh 




P74Q 




4ncom t not 

hUDo i U. 1 .OCl 


yoooo/ 5? 


1 o / / 


onivt 




40^^10 1 Art 

HUJJ 1 v/. 1 -*JOl 


°P7P7 £ >flH1 


?470 

tH / U 


P701 

b ( u 1 


] 


^n^in 1 not 
HUOO I U. I .OCl 


jtOo/Don f 


I O/O 






40^,9.10 i nr t 


yooot 1 0 




PftSfi 

tOOU 




4UOO IU. I .OCl 




1 A7Q 
id / y 


i yuo 


. 


4rm^io 1 Aot 

HUOO t U. 1 -OCl 
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1 .6.oct 


g21 11559 


142 


587 


12 


272721 


1 .6.oct 


4243666H1 


2168 


2424 


12 


27272' 


1 .6.oct 


4558830H1 


144 


270 


12 


27272^ 


\ .6.oct 


1 784361 H1 


2168 


2452 


12 


27272* 


1 .6.oct 


4739703H1 


170 


396 


12 


272721 


1.6. oct 


689468H1 


2173 


2423 


12 


27272 


1 .6.oct 


3349536H1 


176 


446 


12 


27272 1 


1 .6.oct 


5618378H1 


2175 


2375 


12 


27272 


1 .6.oct 


28831 39F6 


226 


616 


12 


27272' 


1 .6.oct 


2707953H1 


2181 


2481 


12 


27272 


1 .6.oct 


1818319H1 


317 


570 


12 


27272- 


1 .6.oct 


4182894H1 


2180 


2428 


12 


27272 


1 .6.oct 


5500902H1 


426 


567 


12 


27272- 


1 .6.oct 


1808379H1 


2191 


2400 


12 


27272 


1 .6.oct 


5500602H1 


427 


661 


12 


27272* 


1 .6.oct 


3648072H1 


2192 


2491 


12 


27272 


1 .6.oct 


2383864F6 


427 


876 


12 


27272* 


1 .6.oct 


g2069859 


2190 


2624 
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12 


272721 6 oct 

£— § ft— * 4b. | . V** »wV» 


3651272H1 


2193 


2488 


12 


272721 6 oct 

ft— 9 ft— f C— 1 • W*WV/» 


351 1433H1 


1596 


1868 


12 

■ ft 


272721 6 oct 


544858H1 


2193 


2427 


12 


272721 6 oct 

ft— # ft— f ft— ■ ■ b' ■ %> 


3443682H1 

b^ ■ i w wwbi ■ ■ 


1633 

■ www 


1900 

i *w* vy \-J 


12 


272721 6 oct 

*— # ft— f tb. 1 - Vy » WW ■> 


359974H1 


2193 


2423 


12 


272721 6 oct 

ft— V ft— 9 ft— ■ » « *. 


355481 7H1 


1634 


1917 


12 


272721 6 OCt 


806833H1 


2197 


2447 

ft— f t # 


12 


272721 6 oct 

b « b» f b I ■ * b^\^fc 


4795906H1 


1635 


1856 


12 
i ft 


272721 6 oct 

ft » 4b # ft • .W-VW* 


1800635H1 

■ ^4^H^ ^^X^ W^K/ 1 1 | 


2198 

4b 1 W<J 


2387 

w U f 


12 


272721 6 oct 

(.1 <• 1 . W, WW I 


1 68961 9F6 


1 640 


1958 

1 w w W 


12 


272721 6 OCt 


4126223H1 


21QR 

t— \ wO 


227Q 


12 

1 4b_ 


272721 6 oct 

C~ 1 C » C- ■ - w . w w I 


1 68961 9H1 


1 640 


1844 


12 

1 b- 


272721 6 oct 


409008H1 


21QQ 


237Q 

t_ W / W 


12 


272721 6 oct 

<_ / t— 1 £— 1 - w , WW L 


5425548H1 


1643 

1 w" w 


1748 

i 9 by 


12 


272721 6 oct 


463051 2H1 


220R 


24RQ 

b.*T\_/W 


12 


272721 6 oct 

ft l ft # 4b 1 .U.UUl 


3238705H1 


1644 


1898 

1 www 


12 

1 ft 


272721 fi oct 

ft / ft / ft 1 .O-wwl 


4121202H1 
"it. i even I 






12 


272721 R ort 

ft * ft / ft. 1 .D.%Jt»l 


37fifiQfi7H1 


Ifi^ft 


19^1 


1? 
■ ft 


272721 6 ort 


59530ftQHl 


1 Qftfi 

1 C700 


OOPiQ 


12 


272721 fi ort 

ft / ft / ft 1 .O.UOI 


304QS24H1 


1fifi3 


1QS4 


12 


272721 fi ort 

ft / ft / ft 1 -U.Uu I 


5952Q2QH1 


1 QAfi 


P9QR 

C4C?Q 


12 


272721 R ort 

ft / ft / ft 1 .u.UUl 


3SRR7^ftH1 
ojuu / uo n i 


1RRR 


1QR8 


12 


272721 fi ort 


4507^001-11 


1 J?wO 


P2R2 


1 2 
I b 


P72721 R ort 

ft/ ft/ ft 1 .O.Uwl 




1R72 

1 \J * ft 


1Q17 

1 s7 1 / 


12 
1 ft 


272721 fi ort 


4^01^14H1 






12 


2797P1 R ort 

ft/ ft / ft 1 .U.UUl 


at R04^^H1 


1fi7fi 


1QRR 


12 


272721 fi ort 




2004 


22Q4 


12 


272721 fi ort 

ft / ft / ft. 1 -O.Uwl 


i^pRSIRRHI 

sJftv/vJ 1 Jjlt 1 


1fiQ0 


1Q44 

1 w*+*-r 


12 


272721 6 ort 


Q61 00SR2 


2004 

fc- W W*T 


25S3 

£ J JO 


12 


272721 fi ort 

ft * ft / ft I • vJ .ww I 


2073037 HI 

ftw / OUO / 11 1 


1694 

1 Ww*"T 


1967 

1 J w / 


12 


272721 fi ort 

ft / ft / ^1 1 .U.UU1 


n221ft^72 


200ft 


217R 

4b 1 / O 


12 
i ^ 


272721 R ort 

ft / ft / ft 1 .U.UUL 




1RQR 

1 WW w 


1Q4Q 

1 w*+ w 


1P 
i ft 


272721 fi ort 
ft / ft / ft ■ .o.uoi 


1R^ft72fiH1 


201^ 


221 ^ 

4b^ 1 O 


12 
i ib 


272721 fi oct 

ft f ft / ft 1 -U.UUl 


27Qfi72^H1 
ft / jjo / fto>ri i 


1RQfi 




12 
i ft 


272721 R ort 


4.^7^1 07H1 


201 3 

£. w 1 O 


22Q3 


12 


272721 fi ort 

ft / ft / ft 1 ■ w .UUl 


421 14R1 H1 

*+ft 1 1 HO 1 11 1 


2341 




12 
i ft 


272721 fi ort 

ft r ft / ft 1 . w>.Uwl 


16^87*57141 


2013 

l_v 1 w 


2220 


12 

1 4b 


272721 6 oct 

ft » ft # 4b 1 . W .UOl 


4547403H1 


2344 


2591 

ft WW 1 


12 


272721 fi ort 

ft / ft S ft 1 .U.Uvl 


4168120H1 

*t 1 \J%J I Cwl 1 1 


2018 

4Lv 1 w 


2294 

4— w" 


12 


272721 6 oct 

ft» ft / ft I - w . WW L 


o 1626091 

y 1 w4— w w w I 


2345 

ft \J^\J 


2556 

b_ w wU 


12 

1 ft 


272721 6 oct 


5152124H1 


2018 


2289 

<■ i «T n Wl br 


12 

1 b 


272721 6 oct 

4b* f ft— f 4b» 1 .W ( V/Wl 


290356T6 

b\/WWwW 1 


2354 

b_ w 


2922 


12 
• <— 


272721 fi oct 

ft/ ft/ ft 1 .U.UWL 


157823H1 


2018 

^- w I w 


2201 

4T 4T W 1 


12 

I C— 


272721 6 oct 

ft— / b_ # 4b— f • W • lr 


a 1996936 

y 1 W W W W W W* 


2358 

^— w w W 


2621 

£- w£- 1 


12 

1 ft— 


272721 6 oct 

ft * ft f ft 1 . U.V^wl 


1 57823 R1 

1 9 fjft— w lit 


2018 


2293 


12 


272721 6 OCt 

4b> / b 1 b I 


387322 H1 

b^b/ f b^4b_4b— 1 I I 


2362 

b WWb 


2642 

■— w"b 


12 


272721 6 oct 


35641 17H1 


2019 


2319 


12 


272721 6 oct 

ft # * ■- f ft-,- ■ * V»» - ftj 


3532652H1 


2368 


2616 


12 

1 b_ 


272721 6 oct 

kl bf cb> 1 • Vy » WW it 


206466F1 


2020 

«— w ft— v 


2639 


12 

i ft— 


272721 6 OCt 

b # b ■ ft— ■ - b*' • » 


578701 3H1 


2370 


2644 


12 

1 ft- 


272721 6 oct 

ft » ft » ft 1 .U.UOl 


464621 5H1 


1902 


2174 


12 


272721 6 oct 

b 1 ftb> f ib> I *X^>b^%^lj 


5784572 H1 


2370 

ft— w / vy 


2676 

4W vy / vy 


12 


272721 6 oct 

b- f b_ r Li I ■ \*J * vy w 


4646289H1 

T V^ft_ • ■ V 


1902 


2072 


12 


272721 6 OCt 

b 9 b» # bto ■ - b* » b* * 


5789842 H1 


2370 


2660 


12 


272721 6 oct 


4855045H1 


1902 


2096 


12 


272721 .6,oct 

ft— 4P ■ * 41 V * Ng*' * X<# % 


5794037H1 


2370 


2671 


12 


272721 6 oct 

« — ' *— . f b— f iVyiVyWft. 


2108791 H1 


1903 


2076 


12 


272721 6 oct 

ft— » ft— * 4* ■ ■X^»^^^^fc 


5788592 H1 

W 4T b* W S/ b_ 1 V 1 


2370 


2671 


12 


272721 6 oct 

l. # C # 4b. 1 ■ Vy ■ VSw 1 


2584487H1 

b\iU~^W fill 


1903 


2070 


12 

■ ftw 


272721 6 oct 

*— * bf b- ■ * \-4^ -b/b4> 


5787092 H1 


2370 

b. b/ # \# 


2645 

b— vy ^Txy 


12 


272721 6 oct 


O734039 


1903 

1 W w V** 


2248 

ft— ft— ft— # 


12 


272721 6 oct 


3408832 H1 


2374 


2613 

bw i by 


12 
■ ft 


272721 6 oct 

£- ' ft 9 ft \ ,\J ,\J\*\. 


2822283H1 


1904 

1 w w ▼ 


2106 

ft— i 


12 

1 4b 


272721 6 oct 

b- ' b_ / bV 1 ♦'-'♦^/Wl 


2932187H1 

bvvb 1 v/ 111! 


2378 

b. w r by 


2633 

ftw WW W 


12 


272721 6 oct 

ft 1 ft / C— 1 . U.uVjl 


4508977 H1 


1906 

1 www 


2154 

4b_ i b^^r 


12 

i ft— 


272721 6 oct 

b # b- / 4b. 1 tU.Uwi 


5465681 H1 


2381 

bww 1 


2547 


12 


272721 R oct 


1534302H1 


1913 

1 w 1 »J 


2124 

4b 1 4b*T 


12 

1 4b 


272721 6 oct 

4b- 9 b_ / 4b. 1 lV»VVi 


a 1979728 
y • w » w / ft 


2388 

ft www 


2621 


12 


272721 fi oct 


1531421H1 

1 V«J I I 1 I 1 


1913 

I w 1 w 


21 10 

C— I 1 w 


12 
i i— 


272721 6 oct 


3500620H1 

W W W W W W 1 1 I 


2389 

C— w w^ w 


2671 


12 


272721 6 oct 


153161 1H1 

1 ww 1 V 1 1 1 1 1 


1913 

1 w ■ w 


21 17 


12 

■ *b 


272721 6 oct 

b * ftb> # b. ■ aVJ^V/Wl 


1979478T6 


2392 


2892 


12 


272721 fi oct 


3560701 H1 

OOUU /will I 


1914 

1 ^ 1 *T 


2240 

4b4b*-rW 


12 

1 4b 


272721 6 oct 

4b f 4b » ft 1 «V.W\>1 


6373060H1 

WW # W W W W 1 1 t 


2398 


2647 

ft W*T / 


12 


272721 fi oct 


n1Q57940 

V_J 1 w w f 3*tU 


1Q20 


2295 

4T. <L w w 


12 


272721 6 oct 

ft/ ft/ ft ■ - w - w w l 


206466 R1 

4.UUtUWI 1 I 


2399 


2941 

l— w ^ 1 


12 


272721 fi ort 


4^7027*^H1 
*t*j/ wb / on i 


1Q24 


21 50 

4b 1 JV 


12 


272721 6 ort 

ft / ft / ft I - \J -WW V 


17093R7H1 

1 / W W WW / III 


2408 

ft*+V/0 


2R23 

ftUftO 


12 


272721 fi ort 


4370270H1 

*T«J / \J£— 1 w 1 1 1 


1924 

1 wt."-r 


2192 

£. 1 w C— 


12 


970701 6 oct 

C— 9 C— f 1 • w . W w I 


51 37771 H1 

W \ \J 1 1 0 llll 


2416 


2675 

ft U / w 


12 


272721 6 oct 


WW w"w 1 Ul 1 1 


1932 


2195 

1 w w 


12 


272721 6 oct 

ft / ft / ft 1 .U.UL»l 


1906004H1 

1 C7 W w V W " 1 1 1 


2419 

ft*T 1 W 


2675 

ftU / w 


12 


272721 fi ort 




1Q32 


2194 


12 
i ft 


27272 '1 6 oct 

ft / ft / ft 1 .U.Ulrfl 


2419Q56H1 

ft T 1 JCJUI 1 1 


2419 

ft"T 1 W 


2659 


12 
i ft 


272721 fi ort 


4742f)17H1 


1Q34 


2070 


12 

1 £. 


272721 fi ort 

ft / ft r ft 1 . w .ww I 


470Q480H1 


R31 


7R5 


12 
1 ft 


272721 fi oct 


2946355H1 

£- W" W WW*w* 1 1 1 


1935 

1 www 


2254 


12 


272721 6 oct 

b. ' b. # b> 1 » w 1 . V v I 


4759041 H1 


634 

WW r 


856 

y*j ww 


12 


272721 6 oct 


3341 847 H1 

w W^T | fill 


1935 

1 w w 1 w 


2166 

1 Ww 


12 

1 4b 


272721 6 oct 

ft/ftr 4b 1 - w - ww i 


2212119H1 

4b4b 1 ft 1 1 W 1 II 


644 


890 

www 


12 


6 oct 


3629088H1 


1935 

1 w vJ*w 


2211 


12 


272721 6 oct 

t_ / t. / t. | r W » W W V 


48751 52H1 


653 

Www* 


838 


12 
■ ft 


272721 6 oct 

4b / * 4b 1 •V^»V/wli 


857934H1 

WW / WV^*T* 1 1 


1937 

■ w w / 


2186 

t_ 1 w w 


12 

1 4b 


272721 6 oct 

4b. ■ ft— / b- 1 - V • w Vi 


1794525H1 


675 

W # b^ 


943 

w^w 


12 

l 4t_ 


272721 6 oct 


4218690H1 

~b 1 UUUUI I 1 


1940 


2043 


12 


272721 6 oct 

b. i ft — J b. ■ i 


1 697241 H1 

1 VJw I *— * llll 


685 

Ww 


928 

v 4b_vy 


12 


272721 6 oct 

4U / 4W * ft* 1 - \— 4* - V/ W I 


a1 664571 


1940 


2356 

bW WW 


12 

■ b— 


272721 6 oct 

b— r b— # b— ■ • *— ' * b* * 


3556564H1 


689 


972 

w ff b 


12 


272721 6 oct 


a1 792741 


1940 


2291 


12 


272721 6 oct 

9^^ 9 9j — ■ ■ ■ b*F ■ ^#»«4r ■> 


4444063H1 


731 


963 


12 

1 4b 


272721 6 oct 


2255504H1 


1944 


2178 


12 

I ft— 


272721 6 oct 

b» f b 9 b 1 


4205033 H1 

~b»V W Vw W 1 1 I 


737 

• by r 


862 

vy vy b. 


12 


272721 6 oct 


2180294H1 


1955 


2076 


12 


272721 6 oct 

ft— * b I ft— 41 « - t- 


3170922H1 


741 


1007 


12 


272721 6 oct 


444281 OH 1 

W I T 4b_ ■ ftj4» ■!! 


1956 


2233 


12 


272721 6 oct 

b- f b* * ft— * •W«MVft 


2540429 H1 

*Wb^ » b» " *— b* ■ ■ ■ 


771 


1004 


12 


272721 6 oct 

ft— r f ft— 1 • ^ .\-f\-* \. 


a 1745686 


1968 


2294 

ft— ft— - w 


12 

i ft— 


272721 6. oct 

ft— W ft— * 1 . \y 


3209223H1 

VbVvbi by 1 1 1 


781 

# by i 


1065 

i vyvy w 


12 


272721 6 oct 

4b # 4b f 4b i .U.Uvl 


5069083H1 


1973 

1 w # w? 


2246 


12 


272721 6 oct 


477951 2H1 

9 9 \J \J % ft— III 


803 

w W w 


948 

w ^ w 


12 
1 ft 


272721 6 oct 


460501 6H1 


1973 

1 w / w 


2228 


12 


272721 6 oct 

4b/ £— f £. 1 » w . WW^ 


5038225H1 

w w w W f r w 1 i i 


810 

w 1 w 


884 


12 

1 ft 


272721 6 oct 

4S» # 4b 1 4b I • V» Wwh 


949436H1 


1973 


2190 

b I W W 


12 

1 4b> 


272721 6 Oct 

ftb. / b. 9 4b. 1 • \J >Uwl 


4760088H1 


810 

W 1 w 


881 


12 


27272 1.6-OCt 


948233H1 


1973 


2221 


12 


272721 .6oct 


4794851 H1 


831 


1067 


12 


27272 1.6.oct 


1798444H1 


1979 


2252 


12 


272721 .6.0Ct 


g1 961401 


872 


1287 


12 


272721 .6.0Ct 


176025H1 


1981 


2261 


12 


272721 .6.oct 


993703H1 


881 


1101 


12 


27272 1.6.oct 


2203232H1 


1983 


2222 


12 


272721 .6.oct 


38021 42H1 


890 


1155 


12 


272721. 6.oct 


g 1670678 


1579 


1968 


12 


272721 .6.oct 


5436252H1 


892 


1092 


12 


27272 1.6.0Ct 


90901 0H1 


1578 


1866 


12 


272721 .6.oct 


g1 984201 


898 


1344 


12 


27272 1.6.0Ct 


5390293H1 


1578 


1810 


12 


272721 .6.oct 


657267H1 


944 


1186 


12 


272721. 6.oct 


5038305H1 


1582 


1849 


12 


272721 .6.oct 


2531018H1 


956 


1178 


12 


27272 1.6.oct 


397501 7H1 


1584 


1861 


12 


272721 .6.0Ct 


3686443H1 


969 


1257 


12 


27272 1.6.0Ct 


5681103H1 


1593 


1851 


12 


272721 .6.oct 


2821084H1 


970 


1263 



196 



WO 01/21836 



PCT/USOO/25643 



Table 4 



12 


272721 


.6.oct 


5391261 H1 


990 


1130 


12 


272721 


.6.oct 


g4085144 


2492 


2940 


12 


272721 


.6.oct 


860948H1 


992 


1237 


12. 


272721 


.6.oct 


g3245239 


2491 


2940 


12 


272721 


,6.oct 


4602249H1 


996 


1250 


12 


272721 


.6. oct 


1629148T6 


2500 


2899 


12 


272721 


6.oct 


2258436H1 


998 


1215 


12 


272721 


.6.oct 


3375874H1 


2500 


2762 


12 


272721 


6.oct 


g1 9791 22 


1002 


1404 


12 


272721 


,6.oct 


g4 107892 


2506 


2956 


12 


272721 


.6.oct 


g2030330 


1006 


1373 


12 


272721 


.6.oct 


g3796949 


2507 


2930 


12 


272721 


.6.oct 


3049484H1 


1016 


1296 


12 


272721 


.6.oct 


g2342174 


2517 


2936 


12 


272721 


6.oct 


5422806H1 


1015 


1277 


12 


272721 


.6.oct 


3705402H1 


2518 


2744 


12 


272721 


.6.oct 


487571 2H1 


1017 


1283 


12 


272721 


.6.oct 


3665055H1 


2522 


2818 


12 


272721 


.6.oct 


2930958H1 


1038 


1321 


12 


272721 


.6.oct 


g5 18303 


2516 


2946 


12 


272721 


.6.oct 


939670H1 


1040 


1283 


12 


272721 


,6.oct 


g 1664375 


2524 


2945 


12 


272721 


.6.oct 


3785755H1 


1047 


1231 


12 


272721 


.6.oct 


g4531573 


2531 


2946 


12 


272721 


.6.oct 


3552260H1 


1064 


1369 


12 


272721 


.6.oct 


415664H1 


2535 


2761 


12 


272721 


,6.oct 


1443934H1 


1069 


1329 


12 


272721 


.6.oct 


414600H1 


2535 


2748 


12 


272721 


.6.oct 


2119473H1 


1072 


1301 


12 


272721 


.6.oct 


g2716083 


2536 


2948 


12 


272721 


.6.oct 


3359947H1 


1086 


1331 


12 


272721 


.6 .oct 


5426832H1 


2534 


2810 


12 


272721 


.6.oct 


5205141 H2 


1087 


1238 


12 


272721 


.6.oct 


3713386H1 


2545 


2838 


12 


272721 


.6.oct 


3725856H1 


1127 


1436 


12 


272721 


.6.oct 


g1 577334 


2546 


2887 


12 


272721 


,6.oct 


4543655H1 


1134 


1376 


12 


272721 


.6.oct 


5809476H1 


2551 


2823 


12 


272721 


.6.oct 


881722R1 


1134 


1711 


12 


272721 


.6.oct 


1680089H1 


2552 


2768 


12 


272721 


,6.oct 


881722H1 


1134 


1256 


12 


272721 


.6.oct 


g4536024 


2556 


2944 


12 


272721 


.6.oct 


568864H1 


1149 


1408 


12 


272721 


.6.oct 


2321 88H1 


2561 


2894 


12 


272721 


.6.oct 


27341 31 H1 


1149 


1405 


12 


272721 


.6.oct 


231662H1 


2563 


2743 


12 


272721 


.6.oct 


5445609H1 


1166 


1365 


12 


272721 


.6.oct 


232222H1 


2563 


2731 


12 


272721 


,6.oct 


4894901 H1 


1176 


1406 


12 


272721 


.6-Oct 


g2779419 


2563 


2930 


12 


272721 


.6.oct 


g1 8601 92 


2578 


2937 


12 


272721 


.6.oct 


28831 39T6 


2566 


2902 


12 


272721 


.6.oct 


1 94061 5T6 


2583 


2902 


12 


272721 


.6.oct 


g4452467 


2567 


2949 


12 


272721 


.6.oct 


5068437H1 


2579 


2852 


12 


272721 


.6. oct 


g3405917 


2576 


2949 


12 


272721 


.6.oct 


1 94061 5R6 


2583 


2939 


12 


272721 


.6.oct 


197888H1 


1697 


1915 


12 


272721 


-6.oct 


1 94061 5H1 


2583 


2822 


12 


272721 


,6.oct 


5072033H1 


1703 


1970 


12 


272721 


.6.oct 


g4222524 


2588 


2936 


12 


272721 


,6.oct 


2613968H1 


1703 


1939 


12 


272721 


.6.oct 


4950868H1 


2589 


2841 


12 


272721 


.6.oct 


962056R1 


1703 


2252 


12 


272721 


.6.oct 


g41 10048 


2590 


2947 


12 


272721 


.6.oct 


962056H1 


1703 


1993 


12 


272721 


.6.oct 


600791 H1 


2591 


2836 


12 


272721 


.6.oct 


6095332H1 


1714 


2026 


12 


272721 


.6.oct 


4981557H1 


2595 


2850 


12 


272721 


1 .6.oct 


2244263H1 


1714 


1940 


12 


272721 


_6.oct 


g4452122 


2596 


2939 


12 


272721 


1 .6.oct 


1865981 H1 


1732 


1991 


12 


272721 


.6.oct 


g1 114427 


2602 


2930 


12 


272721 


.6.oct 


5659364H1 


1736 


1964 


12 


272721 


.6.oct 


g4451191 


2599 


2940 


12 


272721 


.6.oct 


4299096H1 


1737 


1929 


12 


272721 


I.6.oct 


g892675 


2605 


2957 


12 


272721 


.6.oct 


g1 521980 


1754 


2096 


12 


272721 


L6.oct 


g3424444 


2604 


2945 


12 


272721 


1 .6.oct 


5665880H1 


1759 


2008 


12 


272721 


1 .6.oct 


g2111560 


2604 


2950 


12 


272721 


1 .6.oct 


3926250H1 


1767 


2045 


12 


272721 


1 .6.oct 


g1 9821 75 


2608 


2946 


12 


272721 


1 .6.oct 


3846571 H1 


1783 


2077 


12 


272721 


1 .6.oct 


g3919967 


2606 


2942 


12 


272721 


t .6.oct 


1752970H1 


1791 


2049 


12 


272721 


1 .6.oct 


g4391891 


2606 


2941 


12 


272721 


1 .6. oct 


2421660H1 


1791 


2025 


12 


272721 


1 .6.oct 


3624992H1 


2608 


2896 


12 


272721 


1 .6.oct 


1751279H1 


1791 


2021 


12 


272721 


1 .6,oct 


g3918885 


2603 


2938 


12 


272721 


1 ,6.oct 


1 29451 9H1 


1795 


1991 


12 


27272* 


1 .6.oct 


4635746H1 


2612 


2867 


12 


272721 


1 .6.oct 


5286768H1 


1795 


1984 


12 


27272 * 


1 ,6.oct 


885047T1 


2613 


2896 


12 


272721 


1 .6.oct 


2505278H1 


1795 


2030 


12 


27272* 


L6.oct 


885047H1 


2613 


2919 


12 


272721 


1 .6.oct 


1851982H1 


1802 


2081 


12 


27272* 


L6.oct 


2397225H1 


2613 


2842 


12 


272721 


1 .6.oct 


5658144H1 


1807 


2014 


12 


27272* 


1 .6.oct 


g2397616 


2619 


2934 


12 


272721 


1 .6. oct 


577991 4H1 


1805 


2062 


12 


27272' 


1 .6.oct 


g2212530 


2621 


2961 


12 


27272^ 


1 ,6.oct 


5531864H1 


1828 


2086 


12 


27272* 


1 .6-OCt 


4362360H1 


2629 


2904 


12 


27272* 


1 .6.oct 


4742181 H1 


1836 


2097 


12 


27272 


1 .6.oct 


g1 5221 52 


2629 


2861 


12 


27272- 


1 .6.oct 


5217356H1 


1837 


2092 


12 


27272* 


1 .6.oct 


386546H1 


2631 


2917 


12 


27272* 


1 .6.oct 


5159607H1 


1839 


2100 


12 


27272* 


1 .6.oct 


g3117517 


2631 


2950 


12 


27272' 


1 .6.oct 


5619214H1 


1841 


2133 


12 


27272 


1 .6.oct 


g1 157968 


2632 


2941 


12 


27272* 


1 .6.oct 


1629148F6 


1841 


2270 


12 


27272* 


1 .6.oct 


344052H1 


2633 


2821 


12 


27272* 


1 .6.oct 


1629141H1 


1841 


2049 


12 


27272 


1 .6.oct 


2632475T6 


2634 


2896 


12 


27272* 


1 .6.oct 


5607027H1 


1847 


2059 


12 


27272 


1 .6.oct 


g41 14251 


2636 


2940 


12 


27272* 


1 .6.oct 


5139939H1 


1847 


2112 


12 


27272 


1 .6.oct 


g4 136680 


2635 


2949 


12 


27272" 


1 .6.oct 


5679332H1 


1850 


2128 


12 


27272 


1 6.oct 


g3017010 


2638 


2930 


12 


27272* 


1 .6.oct 


5592590H1 


1855 


2084 


12 


27272 


1 .6.oct 


g4284054 


2638 


2946 


12 


27272* 


1 .6.oct 


2918819H1 


1868 


2124 


12 


27272 


1 .6.oct 


389040H1 


2645 


2941 


12 


27272 


1 .6.oct 


.3256891 H1 


1869 


2123 


12 


27272 


t .6.oct 


g4328908 


2480 


2951 


12 


27272 


1 .6 .oct 


1563141H1 


1870 


2090 


12 


27272 


1 .6.oct 


2271363T6 


2484 


2901 


12 


27272 


1 .6.oct 


2652784H1 


1876 


2121 


12 


27272 


1 .6.oct 


1549356H1 


2486 


2610 


12 


27272 


! .6.oct 


4371853H1 


1876 


2152 
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12 


27272 1.6,OCt 


1399608H1 


1878 


2128 


12 


272721 .6-OCt 


6429277H1 


2266 


2655 


12 


27272 1.6-OCt 


541556H1 


1883 


2120 


12 


272721 .6. oct 


6098047H1 


2272 


2445 


12 


272721 .6.0Ct 


g2017461 


1891 


2176 


12 


272721 .6-OCt 


2308651 H1 


2272 


2524 


12 


272721 6 oct 

Wmm 9 Wmmm 9 ^— ■ 


4727656H1 


1898 


2168 


12 


272721 ,6.0Ct 


3014469H1 


2276 


2552 


12 


272721 6 OCt 


g3003890 


2872 


3204 


12 


272721 .6.oct 


786983R1 


2277 


2805 


12 


272721 6 oct 

t_ f Lm- 9 mmmw ■ • Km* * VjF 1 


564887H1 

mm* ^mW W *mm* ^gW * V 9 9 


2875 


3001 


12 


272721 6. oct 

Mmm ¥ Mmm 9 Mmm 9 w Km* * Km* Km* K 


786983 H1 


2277 


2537 


12 


272721 6 Oct 

L — f £ f * 1 • Km* , W *■ 


O2932091 


2875 


2996 


12 


272721 6. oct 

9m— 9 9mm ■ ■ ■ » Km* » Km* Km* K 


830695H1 


2277 


2537 


12 

1 tm~ 


272721 6 oct 


□500166 


2875 

9mm- Km* 9 Km* 


3008 

Km*Km9 \mj SmJ 


12 


272721 6 oct 

9mm. 9 9mm W tmm- 9 m Km* » Km* Km* ft 


1 858762 H1 


2286 


2549 


12 


272721 6 oct 


5525644 H2 

W W W I'll lb 


2908 

*—KS\J\m* 


3182 

w 1 w£~ 


12 

1 Mmm, 


272721 6 oct 

Mm— i 9mm 9 Ml—. I m W - Km* Km* L 


a 1026279 

KmM 9 Km* Ml %m**mm. M Km* 


2293 


2564 


12 

t i-m 


272721 6 oct 

km*. 9 £• i *— 1 - w - Km* Km* I 


1881380T6 

1 \mJ\mJ 9 VUV 1 \mJ 


2928 

W +mS 4mm W 


3385 

Km* KJ lw* KJ 


12 

1 Mmm 


272721 6 OCt 

mmm. 9 Wmmm 9 9-mm 9 • Km* • Km* Kmf % 


4303854H1 

■ Km* Km* Km* Km* Km* T9 ■ 9 


2302 


2567 


12 
• *- 


272721 6 oct 


4941268H1 

~w~T 1 CUUI 1 1 


2228 

r «■ ip w 


2490 


12 


272721 6 oct 

Mm— * Mmm- ' 9mm I ' Km* . W Km* \. 


1662959H1 

» Km* Km* Mmm. Km* Km* Km* ■ t 9 


2302 


2534 


12 


272721 6 Oct 


1 730067 H1 

1 » WW WW fill 


2236 

■ ■ WW 


2459 


12 
* *— 


272721 6 oct 

L. / Mmm 9 Mm— 9 » Km* * Km* Km* I 


a 1792740 

M | f w W a T Km* 


2306 


2458 


12 

1 Mm 


272721 6 oct 


1 29401 2H1 


2252 


2504 


12 


272721 6 oct 

ML— M Mm— * Mmm. 9 * Km* » K* Km* \ 


35961 26H1 

mm* *%m* Km* **** 9 Mamm KmW W 9 ■ 


2312 


2601 


12 


272721 6 oct 


1 293975F1 


2252 

£—£m- Km* C-m. 


2827 

9mm Km* 4mm 9 


12 


272721 6 oct 

Mmm 9 Mmm. 9 9mm, ■ ■ Km* m Km* Km* 9> 


1 68961 9T6 

9 Km* Km* Km* Km* 9 Kmf t \mW 


2307 


2903 


12 


97P721 6 oct 


1293975H1 

1 Mmm W W / W 1 I 1 


2252 

Km/ Mmm 


2497 


12 

1 Mmm 


272721 6 oct 

9mm 9 9mm. * Mm— 9 m Km* • Km* K* % 


1820290H1 

■ *Mmm* Mmmm Km* MmMmKM* Km* 9 9 I 


2317 


2572 


12 


272721 6 oct 


1 272847 H1 

1 ■ 9L \mmW w rial 


2252 


2500 


12 


272721 6 oct 

ll ■ ■ W Mmm. U ■ Km* • Km* Km* % 


1820281 H1 


2317 


2573 


12 


272721 6 OCt 

b 1 t— 1 b I . W . WW I 


29571 28H1 

£— W W / I bVJI ■ * 


2252 

C-C- Km* Mmm 


2545 

SmmKmJ^V KJ 


12 


272721 6 oct 

9mm 9 9mm* Mmm. 9 • Km* * Km* Km* K 


1873055H1 

■ Km** a Km* Kmf Km* Km* M 9 9 


2324 


2555 


12 


272721 6 oct 


243981 7H1 

mmm T "mm/ ^mmT 9 9 9 9 9 


2255 


2485 


12 


272721 .6.oct 


32981 63H1 


2325 


2582 


12 


272721 6 oct 

C / bi / b ■ - w .WW 1 


20401 37H1 


2255 


2519 


12 


272721 .6.0Ct 


805746T1 


2326 


2906 


12 

1 9mm 


272721 6 oct 


2264921 H1 


2257 


2464 


12 


272721 .6.oct 


5527791 H1 


2332 


2605 


12 


272721 6 oct 

b / C— 1 C— 1 - W m WW L 


2268728H1 

■ ■ \J\mf 9 9mm Km* 1 1 1 


2257 


2487 


12 


272721 .6. OCt 


3831404H1 


2336 


2550 


12 


272721 6 oct 


1690407H1 


2259 


2432 


12 


272721 .6.0Ct 


3840984H1 


2338 


2587 


12 


272721 6 oct 


763923H1 

9 \m9 Km* -mf IbV P ■ • 


2260 


2536 


12 


272721 .6.oct 


805746H1 


2338 


2573 


12 


272721 6 oct 


4894289H1 

■ ^mf W Mmm ^mm* Km* 1 * V 


1176 


1459 


12 


272721 .6.0Ct 


2201471 H1 


2337 


2578 


12 


272721 6 oct 


25561 83H1 


1181 


1433 


13 


461603.4.oct 


2667533H1 


1512 


1743 


12 


272721 6 oct 


3319651 H1 

^md* Kmf 9 Ky* Km^^m* 9 9 9 9 


1242 


1519 


13 


461603.4.0Ct 


1862102H1 


1523 


1792 


12 


272721 6 oct 


831550H1 


1250 


1452 


13 


461603.4.oct 


2355465H1 


1580 


1665 


12 


272721 ,6.oct 


3118134H1 


1275 


1558 


13 


461603.4.oct 


g3645529 


1645 


2081 


12 


272721 6 oct 


5902550H1 

Km* Km* KmrummKmW^m* Km* ■ ■ ■ 


1283 


1580 


13 


461603.4.oct 


3293621 H1 


1670 


1906 


12 


272721 6 oct 


4996858H1 

W ^mf ^mf Km* Km* K^ Km* ■ ■ ■ 


1283 


1563 


13 


461603.4.oct 


g2335995 


1736 


2089 


12 


272721 6 oct 


61 13602H1 

Kmr | w Kmr Kmr Km* Mm— ■ ■ R 


1295 


1617 


13 


461603.4.oct 


3752905H1 


1758 


2019 


12 


272721 6 oct 


4518784H1 


1297 


1551 


13 


461603.4.oct 


6094776H1 


1908 


2151 


12 


272721 6 oct 


1989226H1 

V ^/ Km* Km* MmmMmm Km 9 9 9 


1311 


1571 


13 


461603.4.oct 


41801 1H1 


1922 


2103 


12 


272721 6 oct 

9— 9 Cm, i ■ m\mM m.\ m 9\ m 0\ 


956338H1 

Km* Km* KM^K^KmfKmf I t 9 


1317 


1413 


13 


461 603.4 .oct 


417150H1 


1922 


2101 


12 


272721 6 oct 

C— 1 £— I mm, I - W -WWl 


4669770H1 

Twwv 9 9 Km* ■ I 1 


1322 


1560 


13 


461 603,4 .oct 


418870H1 


1922 


2118 


12 


272721 6 oct 


2821 261 H1 

9mm Km* Mmm 9 9mm Km* 9 t 9 9 


1324 


1630 


13 


461 603.4 .oct 


413059H1 


1922 


2129 


12 


272721 6 oct 

C f t_ f t_ 1 . W.wwl. 


1567553H1 

1 Km* Km* 9 Kmf *mm* \—M I I I 


1336 


1535 


13 


461 603.4 .oct 


413059R1 


1922 


2399 


12 


272721 6 oct 

C— 1 I mm, 1 . w . wwl 


4880720H1 


1352 


1590 


13 


461603 4 oct 

■ Km* 9 KmrKmrKmrm » • Km* Km* *■ 


418870R6 

* ■ KmW *mm* 9 Km* 9 %K*r 


1922 


2275 


12 


272721 6 oct 


3881948H1 

KmMKm* Kmf 1 W~V ■ 1 1 


1359 


1622 


13 


461 603.4 .oct 


417906H1 


1922 


2125 


12 


272721 6 oct 

C— i b f £— 1 - w . w^w I 


3156706H1 

Km/ | Km/ KJ 9 Km* Km/ 1 | 1 


1362 


1647 


13 


461 603.4 .oct 


696345H1 


1964 


2151 


12 


272721 6 oct 


50401 18H1 


1380 


1622 


13 


461 603.4 .oct 


5694066H1 


1987 


2177 


12 


272721 6 oct 


3788560H1 

* K^ KJ nmmf ^mm* Kmf 9 I 1 


1382 


1675 


13 


461 603.4 .oct 


2637796H1 


2019 


2288 


12 


272721 6 oct 


3806687H1 

wUvvvU rill 


1389 


1646 


13 


461603.4.oct 


3596965H1 


2058 


2223 


12 

1 Mm 


272721 6 oct 


1282584H1 


1425 


1685 


13 


461 603.4.oct 


2673521 H1 


2077 


2325 


12 


272721 6 oct 


1703878H1 


1441 


1664 


13 


461603.4.oct 

■ Km* m ~*mm m** a w • mmr -~mr m 


3405821 H1 


2085 


2349 


12 


272721 6 oct 


859573 R1 


1445 


2007 


13 


461603.4.oct 


482251 H1 


2100 


2335 


12 
• 


272721 6 oct 


859573H1 


1445 


1679 


13 


461603.4.oct 


484926R6 


2100 


2452 


12 


272721 6 oct 

t— 9 £— 9 9m— 1 » *mJ • \mf\^ \ 


4848606H2 

W ■ ^m ^m* 9 W Mmm 


1446 


1695 


13 


461603.4.oct 


5693609H1 


2114 


2272 


12 


272721 .6 oct 

9mm 9 9mm 9 9mm ■ * m ^m^Xm* % 


1558344H1 

9 KmrKmW^mWhmW ■ " ■ 9 9 


1446 


1638 


13 


461603.4.oct 


g1 188364 


2133 


2267 


12 


272721 6 oct 

*Mm 9 mm*. M Mmmt f m ^0 m ^m* m 


3720883H1 

V Mmmm Km* Km* K** Km* W 9 W 


1454 


1628 


13 


461603.4.oct 


3070461 H1 


2171 


2336 


12 


272721 6 oct 

* 9 Mmm 9 9mm- ■ m^mW mXJT^mt % 


3720892 H1 

Km* ' M, K*r ^m* ^* mmm I ■ ■ 


1454 


1739 


13 


461603.4.oct 


g2779413 


2210 


2611 


12 


272721 6 oct 

mmm M Mmm. W Mmm V m wKmTXm* % 


0261 43H1 


1456 


1796 


13 


461603.4.oct 


3356374T6 


2218 


2437 


12 


272721 6 oct 


5030608H1 


1457 


1723 


13 


461603.4.oct 


g1 780256 


2225 


2624 


12 


272721 .6.oct 


4674029H1 


1470 


1746 


13 


461603.4.oct 


1756956H1 


2404 


2651 


12 


272721 6 OCt 

*mm. t Mmm. M Mmm, ■ - m \mW ^. 


2634326H1 

Mmm Km* ^m* W Km* • ■ Km* ■ ■ 9 


1484 


1731 


13 


461603.4.oct 


1 25071 1F1 


2456 


2798 


12 


272721 6 oct 

* — # 9Lm. 9 9mm ■ • 


5830504H1 

Km* Km* \m* V Km* Km* ~T 9 W ■ 


1489 


1704 


13 


461 603.4 .oct 


5005176H1 


2456 


2684 


12 


272721 .6.oct 


5435568H1 


1494 


1715 


13 


461603.4.oct 


3001108F6 


2554 


3018 


12 


272721 .6.oct 


290356R6 


1494 


1974 


13 


461603.4.oct 


125241 1H1 


2570 


2798 


12 


272721 .6.0Ct 


3356346H1 


1507 


1735 


13 


461 603.4 .oct 


1951691H1 


2583 


2801 


12 


272721 6-Oct 


3342545H1 


1531 


1774 


13 


461603.4.oct 


1786802H1 


2625 


2725 


12 


272721 .6,oct 


4847556H1 


1552 


1806 


13 


461603.4.oct 


1 25071 1H1 


2668 


2798 


12 


272721 .6.oct 


g21 56699 


1574 


1992 


13 


461603.4.oct 


3001108H1 


2712 


3018 


12 


272721 .6.oct 


2503041 H1 


1574 


1797 


13 


461603.4.oct 


3154469H1 


2841 


3111 


12 


272721 6.oct 


4883822H2 


2265 


2537 


13 


461603.4.oct 


g4525001 


1 


365 


12 


272721 .6.0Ct 


1542559H1 


2265 


2478 


13 


461603.4.oct 


g4 148045 


2 


441 


12 


272721 .6.oct 


64301 13H1 


2266 


2655 


13 


461603.4.oct 


g3846311 


5 


228 


12 


272721 .6.oct 


4341009H1 


2265 


2586 


13 


461603.4.oct 


g41 36972 


5 


365 
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13 


461603.4.oct 


a 1005259 


33 


338 


14 


332465.2.dec 


5544928H1 


949 


1161 


13 


461603.4.oct 


2704364T6 


44 


404 - 


14 


332465.2.dec 


5615805R8 


1117 


1510 


13 


461603.4.oct 


2254891 T6 


44 


561 


14 


332465.2.dec 


4815011H1 


1211 


1483 


13 


461603 4 oct 

» XX • XX XX XX ■ 1 V X#* & 


26451 22T6 


46 


510 


14 


332465.2.dec 


g827150 


1285 


1543 


13 


461603 4 oct 


a2620015 


88 

xW 


297 


14 


332465.2.dec 


q312815 


1549 


2494 


13 

■ xv 


461 603 4 oct 

~XX 1 XX V-/ X<J • k. 


2013^16R6 


126 


582 


14 


332465 2 dec 


5283470H1 


1626 


1869 


13 
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12 


290 


35 


204932.4.dec 


2484267H1 


609 


740 


37 


444248.7.dec 


99471 1H1 


14 


255 


35 


204932. 4.dec 


3046336H1 


643 


916 


37 


444248.7.dec 


4380030H1 


15 


297 


35 


204932.4.dec 


3273541 H1 


651 


888 


37 


444248.7.dec 


5158504H1 


13 


283 


36 


400607.1 9.dec 


5372609H1 


1016 


1264 


37 


444248.7.dec 


366639H1 


13 


316 


36 


400607.1 9.dec 


5522590R6 


441 


917 


37 


444248.7.dec 


1695566H1 


14 


277 


36 


400607.1 9.dec 


6051163J1 


1075 


1600 


37 


444248.7.dec 


3046267H1 


16 


340 


36 


400607.1 9.dec 


044309H1 


1151 


1443 


37 


444248.7.dec 


369624H1 


13 


359 


36 


400607.1 9.dec 


3071036T6 


533 


675 


37 


444248.7.dec 


g610262 


14 


278 


36 


400607.1 9.dec 


645271 3H1 


797 


1087 


37 


444248.7.dec 


2180115H1 


15 


306 


36 


400607 1 9 dec 

r x^ x# X^ X» # • P x^ • 


265709H1 


837 


1144 


37 


444248.7.dec 


171713H1 


14 


224 


36 

W W 


400607 1 9 dec 


Q1198101 


1196 


1427 


37 


444248.7.dec 


369649H1 


15 


341 


36 


400607 1 9 dec 


6051 163H1 


864 


1366 


37 


444248.7.dec 


2180095H1 


15 


315 


36 


400607 19. dec 

» * ■ ■ w»v*ww 


g 1938960 


861 


1277 


37 


444248.7.dec 


775276H1 


15 


262 


36 


400607 1 9 dec 


5466843H1 


948 


1155 . 


37 


444248.7.dec 


1438177H1 


16 


311 


36 

WW 


400607 1 9.dec 


509021 6H1 


989 


1255 


37 


444248.7.dec 


2860455H1 


16 


296 


36 


400607 1 9 dec 


70391 2H1 


1216 


1446 


37 


444248.7.dec 


2896853H1 


17 


288 


36 


400607 1 9 dec 


a2879168 


1223 


1447 


37 


444248.7.dec 


369074H1 


17 


324 


36 


400607 1 9 dec 


1542073H1 


1333 


1553 


37 


444248.7.dec 


1434834H1 


17 


296 


36 

WW 


400607 19 dec 


5735728H1 


1345 


1622 


37 


444248.7.dec 


4643424H1 


16 


297 


36 


400607.19.dec 


4884751 H1 


1477 


1660 


37 


444248.7.dec 


2648482H1 


16 


293 


36 


400607.1 9.dec 


4970739H1 


25 


313 


37 


444248.7.dec 


1661048H1 


17 


293 


36 


400607. 19.dec 


4970746H1 


27 


318 


37 


444248.7.dec 


1821754H1 


16 


260 


36 


400607.1 9.dec 


3371686H1 


9 


250 


37 


444248.7.dec 


3118115H1 


18 


340 


36 


400607.1 9.dec 


g1 775491 


32 


121 


37 


444248.7.dec 


3340141 H1 


18 


296 


36 


400607.1 9.dec 


6540907H1 


59 


520 


37 


444248.7.dec 


2182036H1 


18 


302 


36 


400607.1 9.dec 


6115923H1 


58 


370 


37 


444248.7.dec 


2634076H1 


18 


304 


36 


400607. 19.dec 


5522590H1 


69 


323 


37 


444248.7.dec 


2897344H1 


19 


321 


36 


400607.1 9.dec 


4575104H1 


349 


622 


37 


444248.7.dec 


2897344F6 


19 


474 


36 


400607. 19.dec 


3455027H1 


1 


178 


37 


444248.7.dec 


1437855H1 


21 


296 


36 


400607. 19.dec 


562801 2H1 


9 


266 


37 


444248.7.dec 


1692892H1 


20 


263 


36 


400607. 19.dec 


593471 H1 


8 


253 


37 


444248.7.dec 


3339956H1 


21 


311 


36 


400607. 19.dec 


3071036H1 


9 


246 


37 


444248.7.dec 


2461339H1 


21 


292 


36 


400607.1 9.dec 


3071036F6 


9 


359 


37 


444248.7.dec 


3339085H1 


21 


298 


37 


444248.7.dec 


170389H1 


21 


240 


37 


444248.7.dec 


4400849H1 


21 


217 


37 


444248.7.dec 


1655929H1 


18 


208 


37 


444248.7.dec 


1820977H1 


21 


321 


37 


444248.7.dec 


1439566H1 


62 


315 


37 


444248.7.dec 


3979337H1 


22 


320 


37 


444248.7.dec 


2994742H1 


27 


293 


37 


444248.7.dec 


1558436H1 


26 


274 


37 


444248.7.dec 


3213073H1 


54 


335 


37 


444248.7.dec 


2897306H1 


28 


334 


37 


444248.7-dec 


28981 96H1 


17 


292 


37 


444248.7.dec 


768768H1 


31 


312 


37 


444248.7.dec 


1563083H1 


23 


228 


37 


444248.7.dec 


4577849H1 


31 


320 
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37 


444248.7.dec 


1820284H1 


31 


320 


40 


411396.24.dec 


483899T6 


1058 


1514 


37 


444248.7.dec 


24631 63T6 


49 


504 


40 


411396.24.dec 


4549096T1 


1060 


1500 


37 


444248.7.dec 


5433096H1 


89 


342 


40 


411396.24.dec 


1717178H1 


1089 


1336 


37 


444248.7.dec 


1692729H1 


156 


366 


40 


411396.24.dec 


g3645294 


1094 


1538 


37 


444248.7.dec 


4577774H1 


179 


274 


40 


411396.24.dec 


g4853243 


1097 


1538 


37 


444248.7.dec 


5070606H1 


393 


655 


40 


411396.24.dec 


2291135T6 


1097 


1501 


37 


444248.7.dec 


3798947H1 


14 


311 


40 


411396.24.dec 


g4649467 


1126 


1540 


37 


444248.7.dec 


1798841 H1 


47 


293 


40 


411396.24.dec 


g4 195074 


1132 


1541 


37 


444248.7.dec 


4301418H1 


19 


284 


40 


411396.24.dec 


24861 18H1 


1140 


1374 


37 


444248.7.dec 


4301419H1 


19 


281 


40 


411396.24.dec 


g2881577 


1155 


1538 


37 


444248.7.dec 


3115509H1 


18 


129 


40 


411396.24.dec 


g2401496 


1163 


1540 


38 


346599.9.dec 


3863294H1 


1 


253 


40 


411396.24.dec 


690555H1 


1167 


1259 


38 


346599.9.dec 


3746375H1 


1 


261 


40 


41 1396.24. dec 


g21 79339 


1181 


1541 


38 


346599.9.dec 


2514968F6 


6 


423 


40 


411396.24.dec 


261 1377H1 


1184 


1423 


38 


346599.9-dec 


2514968H1 


6 


168 


40 


411396.24.dec 


53371 42H1 


1184 


1440 


38 


346599.9.dec 


2308505H1 


17 


282 


40 


41 1396.24.dec 


1698746H1 


1189 


1386 


38 


346599.9.dec 


2308637H1 


17 


253 


40 


41 1396.24 .dec 


2563483H1 


1203 


1467 


38 


346599.9.dec 


3863323H1 


21 


284 


40 


411396.24.dec 


g 1891 890 


1233 


1541 


38 


346599.9.dec 


866960H1 


23 


277 


40 


411396.24.dec 


g3734944 


1235 


1538 


38 


346599.9.dec 


2561387H1 


22 


289 


40 


411396.24.dec 


715929H1 


1237 


1455 


38 


346599.9.dec 


2523135H1 


30 


276 


40 


41 1 396.24.dec 


5580235H2 


1242 


1505 


38 


346599.9.dec 


2518183H1 


35 


285 


40 


411396.24.dec 


854921 H1 


1246 


1508 


38 


346599.9.dec 


5027445H1 


54 


261 


40 


411396.24.dec 


g21 86094 


1247 


1542 


38 


346599.9.dec 


4155177H1 


73 


338 


40 


411396.24.dec 


g1 550426 


1274 


1538 


38 


346599.9.dec 


2561908H1 


123 


400 


40 


411396.24.dec 


g 1203733 


1277 


1539 


38 


346599.9.dec 


5020830H1 


129 


402 


40 


411396.24.dec 


g2016376 


1287 


1538 


38 


346599.9.dec 


2962428H1 


130 


392 


40 


411396.24.dec 


702514H1 


1302 


1540 


38 


346599.9.dec 


g4891117 


144 


397 


40 


411396.24.dec 


g2211872 


1321 


1546 


38 


346599.9.dec 


g5554144 


174 


385 


40 


411396.24.dec 


2753527H1 


1342 


1544 


38 


346599.9.dec 


51 56541 H1 


180 


428 


40 


411396.24.dec 


g 1203730 


1391 


1539 


38 


346599.9.dec 


1490922H1 


197 


417 


40 


411396.24.dec 


1326642H1 


1397 


1589 


38 


346599.9.dec 


2316064H1 


208 


472 


40 


411396.24.dec 


g5445022 


1403 


1605 


38 


346599.9.dec 


g922819 


212 


418 


40 


411396.24.dec 


3778081 H1 


1410 


1538 


38 


346599.9dec 


374753H1 


216 


423 


40 


411396.24.dec 


g3002301 


1444 


1538 


38 


346599.9.dec 


3424858H1 


228 


499 


40 


411396.24.dec 


g4 125232 


1471 


1538 


38 


346599.9.dec 


g2968918 


226 


418 


40 


411396.24.dec 


1640624H1 


1499 


1687 


38 


346599.9.dec 


g2985496 


249 


418 


40 


411396.24.dec 


1640663F6 


1499 


1687 


38 


346599.9.dec 


g 1063879 


264 


477 


40 


411396.24.dec 


4588787H1 


1499 


1690 


39 


480344 .2.dec 


180805H1 


1 


195 


40 


411396.24.dec 


1640663H1 


1498 


1687 


39 


480344 ,2.dec 


065943H1 


2 


232 


40 


411396.24.dec 


721969H1 


1499 


1613 


39 


480344 .2.dec 


064251 H1 


31 


213 


40 


411396.24.dec 


g2344373 


1 


388 


39 


480344.2 .dec 


071220H1 


36 


236 


40 


411396.24.dec 


4305707H1 


1 


285 


39 


480344.2.dec 


178296R6 


37 


272 


40 


411396.24.dec 


2500749H1 


32 


285 


39 


480344.2.dec 


g 190646 


53 


1555 


40 


411396.24.dec 


4107921 H1 


45 


240 


39 


480344.2.dec 


071770H1 


61 


290 


40 


411396.24.dec 


4529582H1 


46 


300 


39 


480344.2.dec 


g848952 


459 


801 


40 


411396.24.dec 


g2033066 


47 


345 


39 


. 480344.2-dec 


g850712 


476 


738 


40 


411396.24.dec 


4198728H1 


46 


344 


39 


480344.2.dec 


g853560 


509 


839 


40 


411396.24.dec 


4298375H1 


46 


273 


39 


480344.2.dec 


g748042 


511 


712 


40 


411396.24.dec 


2265581 H1 


46 


270 


39 


480344.2.dec 


g 1443831 


525 


745 


40 


411396.24.dec 


g 1971 997 


46 


294 


39 


480344.2.dec 


g848374 


526 


806 


40 


41 1396.24 .dec 


2847731 H1 


47 


302 


39 


480344.2.dec 


g611543 


705 


1064 


40 


411396.24.dec 


213859H1 


48 


199 


39 


480344.2.dec 


0621 74H1 


746 


947 


40 


411396.24.dec 


3246963H1 


49 


300 


39 


480344.2.dec 


063483H1 


746 


940 


40 


411396.24.dec 


4080908H1 


50 


328 


39 


480344.2 .dec 


063548H1 


747 


952 


40 


411396.24.dec 


74941 9H1 


52 


300 


39 


480344 .2.dec 


g839495 


882 


1094 


40 


411396.24.dec 


6027207H1 


52 


343 


39 


480344.2.dec 


g840023 


1252 


1556 


40 


411396.24.dec 


2076551 H1 


54 


263 


39 


480344.2.dec 


g850541 


1282 


1556 


40 


411396.24.dec 


g 1390868 


67 


384 


39 


480344.2.dec 


g650883 


1350 


1553 


40 


411396.24.dec 


g2955514 


69 


440 


39 


480344.2-dec 


g651032 


1396 


1553 


40 


411396.24.dec 


g4268541 


70 


434 


40 


411396.24.dec 


3469388H1 


959 


1205 


40 


411396.24.dec 


290851 7H1 


79 


270 


40 


411396.24.dec 


4032662H1 


973 


1232 


40 


411396.24.dec 


g21 97932 


78 


424 


40 


411396.24.dec 


2265224H1 


1000 


1236 


40 


411396.24.dec 


g5636493 


78 


387 


40 


411396.24.dec 


g1 527442 


1006 


1429 


40 


411396.24.dec 


g51 76779 


81 


393 


40 


411396.24.dec 


4583720H1 


1035 


1331 


40 


411396.24.dec 


g4892654 


83 


388 


40 


411396.24.dec 


4147459H1 


1050 


1276 


40 


411396.24.dec 


g3429293 


88 


431 


40 


411396.24.dec 


4548896T1 


1053 


1502 


40 


411396.24.dec 


g2198197 


89 


390 
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40 


411396.24.dec 


g 1784744 


100 


404 


41 


30281 9.4.dec 


2864086F6 


1037 


1444 


40 


411396.24.dec 


g 1390758 


103 


376 


41 


30281 9.4.dec 


2864086H1 


1037 


1322 


40 


411396.24.dec 


6518113H1 


108 


634 


41 


30281 9.4.dec 


g2835403 


1364 


1587 


40 


411396.24.dec 


2685370H1 


131 


374 


41 


30281 9.4.dec 


394461 2F6 


1376 


1887 


40 


411396.24.dec 


g31 49351 


136 


390 


41 


30281 9.4.dec 


394461 2H1 


1376 


1654 


40 


411396.24.dec 


3601094H1 


138 


383 


41 


30281 9.4.dec 


4311375F6 


1414 


1852 


40 


411396.24.dec 


g31 49496 


136 


388 


41 


30281 9.4.dec 


4311375H1 


1414 


1737 


40 


411396.24.dec 


369361 7H1 


182 


458 


41 


30281 9.4.dec 


1229839H1 


1577 


1800 


40 


411396.24.dec 


5292656H2 


206 


376 


41 


30281 9.4 .dec 


549751 6H1 


1627 


1809 


40 


411396.24.dec 


4743230H1 


209 


390 


41 


30281 9.4 .dec 


4178395H1 


1670 


1933 


40 


411396.24.dec 


g3069995 


216 


390 


41 


30281 9.4 .dec 


g575006 


1833 


2061 


40 


411396.24.dec 


3143092H1 


233 


432 


41 


30281 9.4.dec 


6382656H1 


1834 


2140 


40 


411396.24.dec 


4369528H1 


251 


511 


41 


30281 9.4.dec 


21 52931 H1 


1941 


2197 


40 


411396.24.dec 


4335203H1 


278 


568 


41 


30281 9.4.dec 


g791921 


1957 


2224 


40 


411396.24.dec 


1871893H1 


280 


537 


41 


30281 9.4.dec 


4785922H1 


1958 


2159 


40 


411396.24.dec 


g21 86404 


287 


666 


41 


30281 9.4.dec 


g953672 


1966 


2287 


40 


411396.24.dec 


g21 84660 


287 


665 


41 


30281 9.4.dec 


g1 987264 


1968 


2265 


40 


411396.24.dec 


1947248H1 


306 


548 


41 


30281 9.4.dec 


5449040H1 


2016 


2260 


40 


41 1396.24 .dec 


4643925H1 


341 


608 


41 


30281 9.4.dec 


5449083H1 


2016 


2294 


40 


411396.24.dec 


g1 40021 6 


348 


424 


41 


30281 9.4.dec 


4017288F6 


2099 


2454 


40 


411396.24.dec 


2991447H1 


485 


793 


41 


30281 9.4.dec 


4017288H1 


2099 


2257 


40 


411396.24.dec 


g2930061 


485 


954 


41 


30281 9.4.dec 


648666H1 


2106 


2332 


40 


411396.24.dec 


174444H1 


541 


753 


41 


30281 9.4.dec 


1614398H1 


2139 


2315 


40 


411396.24.dec 


174444R6 


541 


967 


41 


30281 9.4.dec 


4244096F8 


2141 


2441 


40 


411396.24.dec 


173477H1 


541 


733 


41 


30281 9.4.dec 


4754748H1 


2165 


2447 


40 


411396.24.dec 


4717029H1 


564 


805 


41 


30281 9.4.dec 


5268831 H1 


2194 


2380 


40 


411396.24.dec 


205391 9H1 


570 


827 


41 


30281 9.4.dec 


5857347H1 


2203 


2468 


40 


411396.24.dec 


1565945H1 


571 


777 


41 


30281 9.4.dec 


5835683H1 


2204 


2369 


40 


411396.24.dec 


34361 25H1 


587 


814 


41 


30281 9.4 .dec 


4330775H1 


2205 


2459 


40 


411396.24.dec 


4057202H1 


588 


859 


41 


30281 9.4.dec 


1288189F6 


2275 


2788 


40 


411396.24.dec 


5016627H1 


594 


876 


41 


30281 9.4.dec 


1288189H1 


2275 


2523 


40 


411396.24.dec 


1007141 H1 


602 


866 


41 


30281 9.4.dec 


g1 626001 


2286 


2647 


40 


411396.24.dec 


5594568H1 


608 


843 


41 


302819.4.dec 


5113638H1 


2298 


2483 


40 


411396.24.dec 


6523363H1 


617 


1131 


41 


302819.4.dec 


g991068 


2299 


2451 


40 


411396.24.dec 


066953H1 


636 


821 


41 


30281 9.4.dec 


5404686H1 


2431 


2650 


40 


411396.24.dec 


6559965H1 


637 


1204 


41 


30281 9.4.dec 


2705579H1 


2439 


2713 


40 


411396.24.dec 


2904638H1 


676 


965 


41 


30281 9.4.dec 


469051 3H1 


2481 


2679 


40 


411396.24.dec 


g1 891 889 


679 


951 


41 


30281 9.4.dec 


1848103H1 


2539 


2776 


40 


411396.24.dec 


1689134H1 


737 


976 


41 


30281 9.4.dec 


4180730H1 


2557 


2744 


40 


411396.24.dec 


594681 6H1 


809 


1086 


41 


30281 9.4.dec 


6409441 H1 


2559 


3089 


40 


411396.24.dec 


5550984H1 


823 


1043 


41 


30281 9.4.dec 


256921 3H1 


2631 


2873 


40 


411396.24.dec 


2428835H1 


843 


1068 


41 


30281 9.4.dec 


4145445H1 


2641 


2924 


40 


411396.24.dec 


6408161 H1 


872 


1145 


41 


30281 9.4.dec 


6269094H1 


2664 


3208 


40 


411396.24.dec 


g 1238949 


917 


1197 


41 


30281 9.4.dec 


394461 2T6 


2688 


3284 


40 


411396.24.dec 


g1 238951 


917 


1126 


41 


30281 9.4.dec 


g1 88541 5 


2709 


3137 


40 


411396.24.dec 


63451 72H1 


923 


1224 


41 


30281 9.4.dec 


g1 295339 


2731 


3182 


40 


41 1396.24 .dec 


2291135R6 


924 


1299 


41 


302819.4.dec 


1288189T6 


2747 


3292 


40 


411396.24.dec 


2291135H1 


924 


1145 


41 


302819.4.dec 


5845876H1 


2757 


2850 


40 


411396.24.dec 


3716726H1 


933 


1247 


41 


30281 9.4.dec 


3762741 H1 


2773 


3061 


40 


411396.24.dec 


161091H1 


943 


1140 


41 


30281 9.4.dec 


2426329H1 


2777 


3006 


40 


411396.24.dec 


5210085H1 


951 


1166 


41 


302819.4.dec 


4017288T6 


2797 


3294 


41 


302819.4.dec 


41 01 941 H1 


1 


186 


41 


30281 9.4.dec 


4310250H1 


2803 


3112 


41 


30281 9.4.dec 


g4107114 


21 


1636 


41 


30281 9.4.dec 


4310267H1 


2803 


3114 


41 


30281 9.4.dec 


g4589481 


50 


3333 


41 


30281 9.4.dec 


4942836H1 


2806 


3077 


41 


30281 9.4.dec 


6061022H1 


218 


755 


41 


30281 9.4.dec 


4203668H1 


2812 


2967 


41 


30281 9.4.dec 


5481035H1 


281 


557 


41 


30281 9.4 .dec 


6296936H1 


2850 


3171 


41 


30281 9.4.dec 


5481 071 H1 


281 


553 


41 


30281 9.4.dec 


1742377H1 


2853 


3141 


41 


30281 9.4.dec 


5478467H1 


281 


522 


41 


302819.4.dec 


4311375T6 


2858 


3290 


41 


30281 9.4 .dec 


5476596H1 


281 


512 


41 


30281 9. 4. dec 


g31 54601 


2859 


3332 


41 


30281 9.4.dec 


548121 9H1 


281 


450 


41 


30281 9.4.dec 


5478457H1 


2866 


3088 


41 


30281 9.4.dec 


5692922H1 


411 


577 


41 


302819.4.dec 


5482073H1 


2867 


3089 


41 


302819.4.dec 


5091536H1 


826 


909 


41 


30281 9.4.dec 


5482223H1 


2867 


3094 


41 


30281 9.4.dec 


5093136H1 


825 


1088 


41 


30281 9.4.dec 


5480273H1 


2867 


3040 


41 


30281 9.4.dec 


471 0301 H1 


856 


1147 


41 


30281 9.4.dec 


5480023H1 


2867 


2957 


41 


30281 9.4.dec 


3330968H1 


938 


1192 


41 


30281 9.4 .dec 


g4373364 


2892 


3335 


41 


30281 9.4.dec 


6366367H1 


954 


1220 


41 


30281 9.4.dec 


g3411895 


2897 


3331 


41 


30281 9.4.dec 


5844722H1 


970 


1252 


41 


30281 9.4.dec 


21 51 761 H1 


2904 


3168 
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41 


30281 9.4.dec 


2157650F6 


2912 


3333 


43 


399525.3.dec 


180563R1 


680 


1157 


41 


30281 9.4.dec 


2157650H1 


2912 


3026 


43 


399525.3.dec 


180563H1 


680 


997 


41 


30281 9.4.dec 


5408984H1 


2915 


3123 


43 


399525.3.dec 


180563R6 


680 


1075 


41 


30281 9.4.dec 


2312364H1 


2925 


3005 


43 


399525.3.dec 


2009268H1 


1284 


1486 


41 


302819.4.dec 


g 1625899 


2955 


3332 


43 


399525.3.dec 


1 629941 H1 


1293 


1491 


41 


30281 9.4.dec 


g3400238 


2954 


3343 


43 


399525.3.dec 


180563T6 


1341 


1687 


41 


30281 9.4.dec 


g4687972 


3018 


3337 


43 


399525.3.dec 


5469050H1 


1371 


1623 


41 


302819 4 dec 


1940575H1 


3026 


3227 


43 


399525.3.dec 


5469049H1 


1371 


1531 


41 


302819 4 dec 


2328673H1 


3039 


3265 


43 


399525.3.dec 


5674214H1 


1077 


1344 


41 


302819 4 dec 


g4300411 


3043 


3335 


43 


399525.3.dec 


g828577 


1126 


1362 


41 


30281 9.4.dec 


g4267943 


3046 


3336 


43 


399525.3.dec 


180563F1 


1185 


1723 


41 


302819 4 dec 


q825048 


3051 


3341 


43 


399525.3.dec 


1267377T6 


1250 


1684 


41 


302819 4 dec 


q991036 


3052 


3331 


43 


399525.3.dec 


1267377F6 


102 


584 


41 


30281 9. 4. dec 


g567561 


3097 


3334 


43 


399525.3.dec 


1267377F1 


102 


515 


41 


302819.4 dec 


g1 898761 


3121 


3336 


43 


399525.3.dec 


g2358795 


127 


184 


41 


302819 4. dec 


q953400 


3215 


3338 


43 


399525.3.dec 


5533757H1 


452 


689 


41 


302819 4 dec 


171 2751 F6 


3235 


3334 


43 


399525.3.dec 


5521485H1 


499 


718 


41 


302819 4 dec 


1712751H1 


3235 


3334 


43 


399525.3.dec 


5624724H1 


519 


815 


41 


302819 4 dec 


171 2751 T6 


3239 


3296 


43 


399525.3.dec 


1267377H1 


102 


361 


41 


302819 4 dec 


2107049H1 


3257 


3334 


43 


399525.3.dec 


5173655H1 


651 


908 


41 


302819 4 dec 


g4532014 


2983 


3333 


44 


222795.6.dec 


g5231568 


1 


380 


41 


302819 4 dec 


5839592H1 


2990 


3212 


44 


222795.6.dec 


g4438873 


1 


420 


41 


302819 4.dec 


g4685396 


3018 


3337 


44 


222795.6.dec 


6478906H1 


1 


373 


41 


302819.4.dec 


g4299064 


2964 


3333 


44 


222795.6.dec 


g4888931 


8 


353 


41 


302819.4-dec 


g4599081 


2965 


3335 


44 


222795.6.dec 


g51 78893 


8 


400 


41 


30281 9.4.dec 


g 1265092 


2965 


3331 


44 


222795.6.dec 


g4301971 


8 


432 


41 


30281 9.4.dec 


240555H1 


2977 


3157 


44 


222795.6.dec 


g4436204 


8 


328 


42 


238734 ,2.dec 


g2307091 


1 


691 


44 


222795.6.dec 


g3401868 


8 


316 


42 


238734.2.dec 


6507926H1 


1 


352 


44 


222795.6.dec 


g41 96406 


8 


271 


42 


238734 .2.dec 


4447563H1 


19 


290 


44 


222795.6.dec 


3404674H1 


17 


79 


42 


238734 .2.dec 


2027993H1 


168 


440 


44 


222795.6.dec 


35951 84H1 


28 


317 


42 


238734 ,2.dec 


2027993R6 


168 


644 


44 


222795.6.dec 


1616015H1 


62 


278 


42 


238734.2.dec 


6075427H1 


277 


593 


44 


222795.6.dec 


1616788F6 


62 


384 


42 


238734 .2.dec 


1403261 H1 


544 


797 


44 


222795.6.dec 


2642562H1 


85 


331 


42 


238734 .2.dec 


1 403261 F6 


544 


930 


44 


222795.6.dec 


2642562F6 


85 


248 


42 


238734.2.dec 


6454693H1 


693 


1320 


44 


222795.6.dec 


3935234H1 


146 


420 


42 


238734. 2. dec 


6512287H1 


1018 


1563 


44 


222795.6.dec 


58621 38H1 


160 


420 


42 


238734 2 dec 


5868348H1 


1160 


1439 


44 


222795.6.dec 


5636483H1 


183 


432 


42 


238734 .2. dec 


5868348F6 


1159 


1660 


44 


222795.6.dec 


5517092H1 


182 


390 


42 


238734 2 dec 


9021 43R1 


1458 


1956 


44 


222795.6.dec 


6404623H1 


395 


606 


42 


238734. 2.dec 


9021 43H1 


1458 


1736 


44 


222795.6.dec 


5349471 F6 


186 


639 


42 


238734 .2. dec 


1 40326 1T6 


1643 


2240 


44 


222795.6.dec 


5349471 H1 


186 


436 


42 


238734 ,2.dec 


531505T6 


1705 


2112 


44 


222795.6.dec 


6323776H1 


372 


571 


42 


238734. 2. dec 


1511070H1 


1830 


2037 


44 


222795.6.dec 


37481 45H1 


378 


622 


42 


238734.2.dec 


1511070F6 


1830 


2183 


44 


222795.6.dec 


6552507H1 


590 


701 


42 


238734 ,2.dec 


25691 10H1 


1893 


2154 


44 


??2795.6.dec 


6564675H1 


592 


779 


42 


238734 .2.dec 


2568849H1 


1893 


2154 


44 


222795.6.dec 


6181144H1 


625 


910 


42 


238734.2.dec 


6260538H1 


1963 


2248 


44 


222795.6.dec 


3519117H1 


705 


976 


43 


399525.3.dec 


6120694H1 


754 


1295 


44 


222795.6.dec 


64291 78H1 


726 


877 


43 


399525.3.dec 


2468280H1 


710 


947 


44 


222795.6.dec 


3774768H1 


893 


1190 


43 


399525.3.dec 


484451 9H1 


1 


269 


44 


222795.6.dec 


2214563H1 


1114 


1383 


43 


399525.3.dec 


3747370H1 


37 


337 


45 


410628.5.dec 


5469732H1 


1 


256 


43 


399525.3.dec 


341 021 H1 


59 


272 


45 


410628.5.dec 


180125H1 


2 


220 


43 


399525.3.dec 


5983547H1 


78 


335 


45 


410628.5.dec 


g2008865 


154 


461 


43 


399525.3.dec 


55221 63H1 


88 


326 


45 


410628.5.dec 


3364980H1 


185 


440 


43 


399525.3.dec 


492858H1 


858 


1088 


45 


41Q628.5.dec 


5912578H1 


239 


513 


43 


399525.3.dec 


2928668F6 


877 


1159 


45 


410628.5-dec 


2829339F6 


378 


776 


43 


399525.3.dec 


2928668H1 


878 


1163 


45 


410628.5.dec 


2829339H1 


378 


642 


43 


399525.3.dec 


42891 36H1 


879 


1140 


45 


410628.5.dec 


3807945H1 


389 


692 


43 


399525.3.dec 


554461 2H2 


893 


970 


45 


410628.5.dec 


4008525H1 


639 


896 


43 


399525.3.dec 


405758H1 


900 


1121 


45 


410628.5.dec 


1852093F6 


707 


1214 


43 


399525.3.dec 


g1 685821 


910 


1234 


45 


410628.5.dec 


1852093H1 


707 


974 


43 


399525.3.dec 


3034576F6 


950 


1476 


45 


410628.5.dec 


2068781 H1 


727 


1017 


43 


399525.3.dec 


3034576F7 


950 


1418 


45 


410628.5.dec 


2131281H1 


727 


959 


43 


399525.3.dec 


3034576H1 


951 


1250 


45 


410628-5.dec 


3483874H1 


749 


1023 


43 


399525.3.dec 


6157370H1 


985 


1233 


45 


410628.5.dec 


1956841 H1 


759 


1021 


43 


399525.3.dec 


6588995H1 


1047 


1558 


45 


410628.5.dec 


2928243H1 


778 


1037 
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45 


410628.5.dec 


3893475H1 


908 


1148 


46 


053649.6. dec 


2907550H1 


4841 


5144 


45 


410628.5.dec 


077976H1 


909 


1059 


46 


053649.6.dec 


6349673H2 


4841 


5135 


45 


410628.5.dec 


2187402H1 


927 


1197 


46 


053649.6.dec 


3933835H1 


4934 


5204 


45 


410628.5.dec 


g1 63941 8 


934 


1256 


46 


053649.6.dec 


3685046H1 


4934 


5172 


45 


410628.5.dec 


1910319H1 


955 


1197 


46 


053649.6.dec 


4312063H1 


4955 


5233 


45 


410628.5.dec 


2433701 H1 


1023 


1250 


46 


053649.6.dec 


1582279H1 


4957 


5132 


45 


410628.5.dec 


1688079H1 


1048 


1249 


46 


053649.6.dec 


g3988652 


4982 


5454 


45 


410628.5.dec 


2928651 H1 


1050 


1142 


46 


053649.6.dec 


795706H1 


4990 


5228 


45 


410628 5 dec 


5856935H1 


1088 


1381 


46 


053649.6.dec 


1871289T6 


5005 


5412 


45 


41 0628 5 dec 


6045234H1 


1118 


1591 


46 


053649.6.dec 


4433271 H1 


5023 


5281 


45 


410628 5 dec 


6324874H1 


1118 


1176 


46 


053649.6.dec 


1947703H1 


5099 


5298 


45 


410628 5 dec 


a 1638026 


1130 


1432 


46 


053649.6.dec 


g4763477 


5110 


5454 


45 


41 0628 5 dec 


1907874H1 


1152 


1418 


46 


053649.6.dec 


1876550H1 


5134 


5420 


45 


410628 5 dec 


1964088R6 


1200 


1575 


46 


053649. 6. dec 


2427603H1 


5155 


5410 


45 


410628 5 dec 


1964088H1 


1200 


1470 


46 


053649.6.dec 


g3848344 


5168 


5454 


45 


410628 5 dec 


1352433H1 


1201 


1414 


46 


053649.6.dec 


5274403H1 


5182 


5436 


45 


410628 5 dec 


6045234 J 1 


1205 


1805 


46 


053649.6.dec 


2441 651 H1 


5200 


5424 


45 


410628 5 dec 


3752470H1 


1214 


1507 


46 


053649.6.dec 


45401 6R6 


5209 


5689 


45 


410628.5.dec 


4720194H1 


1231 


1485 


46 


053649.6.dec 


45401 6H1 


5209 


5432 


45 


41 0628 5.dec 


2829339T6 


1238 


1808 


46 


053649.6.dec 


3975767H1 


5228 


5505 


45 


410628 5 dec 


591385H1 


1269 


1495 


46 


053649.6.dec 


g5526998 


5242 


5456 


45 


410628.5.dec 


376651 3H1 


1286 


1580 


46 


053649.6.dec 


27771 35H1 


5244 


5456 


45 


410628.5.dec 


2674849H1 


1347 


1596 


46 


053649.6.dec 


g1 266554 


5245 


5526 


45 


410628.5.dec 


326691 8H1 


1373 


1653 


46 


053649.6.dec 


6209668H1 


5299 


5610 


45 


410628.5.dec 


g5232075 


1373 


1840 


46 


053649.6.dec 


282574H1 


5302 


5453 


45 


410628.5.dec 


3412689H1 


1374 


1617 


46 


053649.6.dec 


528280H1 


5344 


5630 


45 


410628.5.dec 


4502789H1 


1377 


1571 


46 


053649.6.dec 


g1 623674 


5351 


5653 


45 


410628.5.dec 


396281 8H1 


1378 


1675 


46 


053649.6.dec 


g2023447 


5386 


5614 


45 


410628.5.dec 


g2322969 


1399 


1831 


46 


053649.6.dec 


4996368H1 


5418 


5688 


45 


410628.5.dec 


g2432892 


1455 


1829 


46 


053649.6.dec 


4873622H1 


5441 


5700 


45 


410628.5.dec 


1964088T6 


1485 


1822 


46 


053649.6.dec 


1918585H1 


5462 


5730 


45 


410628.5.dec 


g4734424 


1491 


1942 


46 


053649.6.dec 


394998R6 


5483 


5942 


45 


410628.5.dec 


1710011T6 


1493 


1815 


46 


053649.6.dec 


2608003H1 


5528 


5778 


45 


410628.5.dec 


g1 9551 37 


1511 


1816 


46 


053649.6. dec 


286921 3H1 


5536 


5749 


45 


410628.5.dec 


2604776F6 


1522 


2027 


46 


053649.6.dec 


2846562H1 


5536 


5809 


45 


410628.5.dec 


2604776H1 


1522 


1763 


46 


053649.6.dec 


3549988H1 


5536 


5775 


45 


410628.5.dec 


563641 1H1 


1523 


1783 


46 


053649.6.dec 


2878203H1 


5538 


5804 


45 


410628.5-dec 


g2736627 


1555 


1948 


46 


053649.6.dec 


038436H1 


5551 


5798 


45 


410628.5.dec 


g5632819 


1568 


1884 


46 


053649.6.dec 


2859240H1 


5556 


5812 


45 


41 0628.5. dec 


373891 2H1 


1574 


1881 


46 


053649.6.dec 


447581 H1 


5563 


5777 


45 


410628.5.dec 


g4331385 


1574 


1859 


46 


053649.6.dec 


6543240H1 


5619 


6031 


45 


410628 5 dec 


g2002540 


1592 


1837 


46 


053649.6.dec 


1924182R6 


5631 


5923 


45 


410628.5.dec 


g2255929 


1638 


1837 


46 


053649.6.dec 


1924182H1 


5631 


5853 


45 


410628 5.dec 


g43 10985 


1641 


2104 


46 


053649.6.dec 


45401 6T6 


5637 


6309 


45 


410628.5.dec 


g1 9551 38 


1643 


1883 


46 


053649.6.dec 


2857928H1 


5651 


5916 


45 


410628.5.dec 


4085875H1 


1679 


1943 


46 


053649.6.dec 


6518276H1 


5654 


6091 


45 


410628.5.dec 


g4703429 


1840 


2283 


46 


053649.6.dec 


3093841 H1 


5656 


5933 


46 


053649.6.dec 


g2140414 


4715 


5138 


46 


053649.6.dec 


g2155719 


5667 


5875 


46 


053649.6.dec 


g2752467 


4717 


5141 


46 


053649.6.dec 


1924182T6 


5689 


6258 


46 


053649.6.dec 


g3738044 


4718 


5139 


46 


053649.6.dec 


311 6771 H1 


5729 


6000 


46 


053649.6.dec 


4823575H1 


4717 


4978 


46 


053649.6.dec 


3510740T6 


5789 


6255 


46 


053649 6.dec 


6158080H1 


4721 


4829 


46 


053649.6.dec 


2357646F6 


5787 


6264 


46 


053649.6.dec 


g4607123 


4728 


5134 


46 


053649.6.dec 


2357646H1 


5787 


6011 


46 


053649.6.dec 


3535641 H1 


4740 


5025 


46 


053649.6.dec 


018607H1 


5789 


6079 


46 


053649.6.dec 


604476H1 


4749 


4991 


46 


053649.6.dec 


017537H1 


5789 


6050 


46 


053649.6.dec 


6073 12H1 


4750 


4997 


46 


053649.6.dec 


015937H1 


5789 


6038 


46 


053649.6.dec 


2865773H1 


4777 


5022 


46 


053649.6.dec 


016285H1 


5789 


6052 


46 


053649.6.dec 


g2877086 


4781 


5141 


46 


053649.6.dec 


621377R6 


5821 


6295 


46 


053649.6.dec 


1 562789T6 


4780 


5129 


46 


053649.6.dec 


621377T6 


5821 


6258 


46 


053649.6.dec 


1399654H1 


4786 


5034 


46 


053649.6.dec 


621377H1 


5821 


6083 


46 


053649.6.dec 


g1 957074 


4788 


5061 


46 


053649.6.dec 


g21 97952 


5853 


6298 


46 


053649.6.dec 


3039382H1 


4795 


5050 


46 


053649.6.dec 


g 1859838 


5860 


6289 


46 


053649.6.dec 


g4329155 


4800 


5133 


46 


053649.6.dec 


g4982575 


5873 


6297 


46 


053649.6.dec 


311 6891 H1 


4813 


5054 


46 


053649.6.dec 


g5632069 


5874 


6303 


46 


053649.6.dec 


2351206H1 


4819 


5029 


46 


053649.6.dec 


3331637H1 


5875 


6123 


46 


053649.6.dec 


4593993H1 


4835 


5096 


46 


053649.6.dec 


g39 17437 


5878 


6304 


46 


053649.6.dec 


4618056H1 


4835 


5084 


46 


053649.6.dec 


33201 10H1 


5881 


6166 
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46 


053649.6.dec 


470544H1 


5881 


6131 


46 


053649.6-dec 


22851 80H1 


4464 


4717 


46 


053649.6-dec 


g4260778 


5892 


6346 


46 


053649.6.dec 


22851 80R6 


4465 


4934 


46 


053649-6.dec 


061289H1 


5901 


6055 


46 


053649.6-dec 


4381173H1 


4489 


4762 


46 


053649.6.dec 


g5526193 


5903 


6303 


46 


053649.6.dec 


3071901 H1 


4490 


4775 


46 


053649.6.dec 


3200859H1 


5937 


6216 


46 


053649.6-dec 


2246853H1 


4497 


4749 


46 


053649.6.dec 


2463691 H1 


5960 


6177 


46 


053649.6.dec 


2519418H1 


4503 


4759 


46 


053649.6-dec 


2378781 H1 


5963 


6184 


46 


053649.6-dec 


2589642H1 


4503 


4747 


46 


053649.6.dec 


849937H1 


6881 


7000 


46 


053649.6.dec 


1684996H1 


4503 


4687 


46 


053649.6-dec 


6396030H1 


6458 


6756 


46 


053649.6.dec 


6109482H1 


4561 


4852 


46 


053649.6-dec 


g3094377 


6488 


6953 


46 


053649.6.dec 


1509877H1 


4561 


4755 


46 


053649.6-dec 


g4606923 


6500 


6957 


46 


053649.6.dec 


5091092H1 


4588 


4843 


46 


053649.6.dec 


g751556 


6503 


6702 


46 


053649.6.dec 


50741 94H1 


4621 


4900 


46 


053649.6.dec 


g4291182 


6509 


6959 


46 


053649.6-dec 


3770734H1 


4644 


4927 


46 


053649.6-dec 


g3849739 


6509 


6957 


46 


053649.6.dec 


655534H1 


4649 


4876 


46 


053649.6-dec 


g5671130 


6517 


6968 


46 


053649.6.dec 


3865590H1 


4643 


4839 


46 


053649.6-dec 


g3764849 


6530 


6954 


46 


053649.6.dec 


2751219H1 


6404 


6675 


46 


053649.6-dec 


2230287H1 


6551 


6799 


46 


053649.6.dec 


1463185H1 


6085 


6262 


46 


053649.6.dec 


g1218106 


6563 


6953 


46 


053649.6-dec 


1463209T6 


6085 


6259 


46 


053649.6.dec 


g3756320 


6573 


6959 


46 


053649.6.dec 


447899H1 


6097 


6164 


46 


053649.6.dec 


g2188518 


6591 


6910 


46 


053649.6. dec 


394998T6 


6121 


6634 


46 


053649.6.dec 


4000865H1 


6644 


6917 


46 


053649.6.dec 


126467H1 


6122 


6390 


46 


053649.6.dec 


g2270783 


6663 


6971 


46 


053649.6.dec 


2603006H1 


6180 


6456 


46 


053649. 6.dec 


570981 6H1 


6073 


6342 


46 


053649.6-dec 


582896H1 


6194 


6440 


46 


053649.6-dec 


2469521 H1 


6085 


6306 


46 


053649.6-dec 


36301 89H1 


6199 


6389 


46 


053649 .6-dec 


42561 23H1 


5968 


6240 


46 


053649.6.dec 


4629688H1 


6205 


6357 


46 


053649.6-dec 


4245703H1 


5978 


6255 


46 


053649.6-dec 


g4326882 


6226 


6674 


46 


053649.6-dec 


g5446227 


5984 


6348 


46 


053649.6-dec 


g4852252 


6234 


6680 


46 


053649.6.dec 


4091718H1 


6032 


6256 


46 


053649.6-dec 


g4074077 


6250 


6679 


46 


053649.6.dec 


2357646T6 


6065 


6299 


46 


053649.6.dec 


474451 0H1 


6253 


6495 


46 


053649.6-dec 


g 1778032 


1 


6659 


46 


053649.6.dec 


2602237F6 


6709 


6953 


46 


053649.6-dec 


1271488H1 


270 


501 


46 


053649.6.dec 


2602237H1 


6709 


6980 


46 


053649.6.dec 


61 77281 H1 


282 


549 


46 


053649.6-dec 


2602237T6 


6710 


6915 


46 


053649.6.dec 


4535853H1 


1492 


1746 


46 


053649.6.dec 


505561 H1 


6772 


6953 


46 


053649.6-dec 


2998346H1 


1576 


1847 


46 


053649.6.dec 


1463209R6 


6085 


6297 


46 


053649 .6.dec 


183990H1 


1784 


1957 


46 


053649.6.dec 


g3770766 


6708 


6962 


46 


053649.6.dec 


2416184H1 


1894 


2137 


47 


221914.2.dec 


2692351 H1 


66 


317 


46 


053649.6-dec 


553761 4H1 


2185 


2376 


47 


221914.2.dec 


3324937H1 


40 


301 


46 


053649.6.dec 


g2817443 


2299 


2770 


47 


221914.2.dec 


1510086H1 


42 


260 


46 


053649.6-dec 


2015073H1 


2315 


2562 


47 


221914.2.dec 


1521573H1 


47 


236 


46 


053649. 6.dec 


g3644396 


2346 


2771 


47 


221914.2.dec 


340241 1H1 


65 


235 


46 


053649. 6. dec 


g4270510 


2435 


2770 


47 


221914.2.dec 


2991821 H1 


205 


426 


46 


053649.6.dec 


5080263H1 


2919 


3138 


47 


221914.2.dec 


3417004H1 


240 


480 


46 


053649.6.dec 


4872695H1 


2991 


3250 


47 


221914.2.dec 


3393041 H1 


275 


543 


46 


053649.6.dec 


4398050H1 


3001 


3257 


47 


221914.2.dec 


g3405497 


405 


701 


46 


053649. 6.dec 


2723988H1 


3048 


3289 


47 


221914.2.dec 


4515647H1 


468 


682 


46 


053649.6-dec 


2836887H1 


3186 


3441 


47 


221914.2.dec 


4121840H1 


504 


719 


46 


053649. 6.dec 


3901067H1 


3335 


3594 


47 


221914.2.dec 


3422922H1 


504 


728 


46 


053649.6-dec 


3510740F6 


3517 


3957 


47 


221914.2.dec 


4514527H1 


516 


761 


46 


053649.6.dec 


3510740H1 


3517 


3786 


47 


221914.2.dec 


1 95781 2H1 


608 


883 


46 


053649. 6.dec 


3340558H1 


3636 


3901 


47 


221914.2.dec 


2137153F6 


1 


168 


46 


053649.6.dec 


4347614H1 


3688 


3944 


47 


221914.2.dec 


2137153H1 


1 


236 


46 


053649.6.dec 


1562789F6 


3799 


4220 


47 


221914.2.dec 


3615140H1 


6 


287 


46 


053649.6-dec 


1 562781 H1 


3799 


4019 


47 


221914.2.dec 


2017382H1 


17 


145 


46 


053649.6-dec 


1562789H1 


3799 


4021 


47 


221914.2-dec 


3508367H1 


17 


116 


46 


053649.6.dec 


2431585H1 


3818 


4046 


47 


221914.2.dec 


3748348H1 


17 


188 


46 


053649.6-dec 


5408836H1 


3906 


4164 


47 


221914.2-dec 


1491036H1 


37 


210 


46 


053649 .6.dec 


4350359H1 


3911 


4166 


47 


221914.2.dec 


2520837H1 


38 


265 


46 


053649.6.dec 


881810H1 


3939 


4180 


47 


221914.2.dec 


1 95781 2F6 


608 


1033 


46 


053649.6-dec 


640841 2H1 


3945 


4490 


47 


221914.2.dec 


1871603H1 


706 


831 


46 


053649.6-dec 


6454776H1 


3990 


4455 


47 


221914.2.dec 


3939407H1 


821 


925 


46 


053649.6.dec 


1708247H1 


4017 


4242 


48 


347748.2.dec 


3272638H1 


1 


235 


46 


053649.6.dec 


1376453H1 


4018 


4238 


48 


347748.2.dec 


3568428H1 


5 


318 


46 


053649.6-dec 


2917695H1 


4120 


4396 


48 


347748.2.dec 


1591195H1 


17 


220 


46 


053649.6-dec 


491 3391 H1 


4123 


4316 


48 


347748.2.dec 


1591 381 H1 


17 


242 


46 


053649.6-dec 


2433842H1 


4141 


4349 


48 


347748.2.dec 


3108209H1 


18 


112 


46 


053649.6.dec 


6477331 H1 


4227 


4751 


48 


347748.2.dec 


63691 07H1 


24 


471 


46 


053649.6.dec 


22851 80T6 


4466 


5094 


48 


347748.2.dec 


3436985H1 


25 


112 
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48 


347748.2.dec 


3472447H1 


31 


267 ' 


49 


401482.2.oct 


436341 2H1 


140 


231 


48 


347748.2.dec 


3271083H1 


171 


402 


49 


401482.2.oct 


592882H1 


140 


262 


48 


347748.2.dec 


4181728H1 


195 


281 


49 


401482.2.oct 


4128816H1 


140 


306 


48 


347748.2.dec 


6477761 H1 


229 


407 


49 


401482.2.oct 


5066972H1 


141 


237 


48 


347748.2.dec 


g3778189 


312 


759 


49 


401482.2.oct 


63751 87H1 


140 


221 


48 


347748.2.dec 


g31 79263 


312 


497 


49 


401482.2.oct 


4057141 H1 


140 


268 


48 


347748.2.dec 


g3050805 


314 


558 


49 


401482.2.oct 


368551 H1 


143 


294 


48 


347748.2.dec 


g31 74659 


315 


812 


49 


401482.2.oct 


5262823H1 


140 * 


292 


48 


347748.2.dec 


512461 H1 


338 


542 


49 


401482.2.oct 


4008852H1 


140 


272 


48 


347748.2.dec 


51 2461 R6 


338 


686 


49 


401482.2.OCt 


4563645H1 


140 


220 


48 


347748.2.dec 


446066H1 


338 


579 


49 


401482.2.oct 


4982802H1 


148 


301 


48 


347748.2.dec 


442400H1 


339 


579 


49 


401482.2.oct 


3490679H1 


140 


303 


48 


347748. 2. dec 


3097735H1 


484 


712 


49 


401482.2.oct 


3142629H1 


143 


265 


48 


347748.2.dec 


3540123H1 


509 


787 


49 


401482.2.oct 


820429H1 


142 


238 


48 


347748.2.dec 


3319530H1 


576 


637 


49 


401482.2.oct 


1268873H1 


140 


304 


48 


347748.2.dec 


g 1970848 


599 


927 


49 


401482.2.oct 


2839681 H2 


140 


269 


48 


347748.2.dec 


g 1970843 


599 


872 


49 


401482.2.OCt 


3865664H1 


141 


257 


48 


347748.2.dec 


g2000224 


609 


870 


49 


401482.2.oct 


4662861 H1 


141 


207 


48 


347748 .2.dec 


425094H1 


803 


951 


49 


401482.2.oct 


1708280H1 


136 


250 


48 


347748.2.dec 


424797H1 


803 


1010 


49 


401482.2.oct 


g 1983680 


136 


299 


48 


347748.2.dec 


4281 93H1 


803 


1019 


49 


401482.2.oct 


3897334H1 


135 


304 


48 


347748.2.dec 


25521 19H1 


853 


1114 


49 


401482.2.oct 


g1 99631 5 


135 


323 


48 


347748.2.dec 


5511770H1 


617 


861 


49 


401482.2.oct 


4743463H1 


135 


277 


48 


347748.2.dec 


6489362H1 


689 


1258 


49 


401482.2.oct 


3570680H1 


135 


274 


48 


347748.2.dec 


4674740H1 


690 


990 


49 


401482.2.oct 


3891485H1 


129 


293 


48 


347748.2.dec 


1405304H1 


972 


1249 


49 


401482.2.oct 


983595H1 


129 


239 


48 


347748.2.dec 


2868023H1 


999 


1231 


49 


401482.2.oct 


4956529H1 


129 


290 


48 


347748.2.dec 


g2011228 


715 


1019 


49 


401482.2.oct 


1681495H1 


129 


311 


48 


347748 .2 .dec 


4774949H1 


727 


871 


49 


401482.2.oct 


5376752H1 


124 


296 


48 


347748.2.dec 


50041 88H1 


791 


1031 


49 


401482.2.oct 


1308930H1 


129 


238 


48 


347748.2.dec 


2687450H1 


801 


1060 


49 


401482.2.oct 


3881848H1 


129 


332 


48 


347748.2.dec 


4281 93R6 


803 


1166 


49 


401482.2.oct 


1643328H1 


129 


311 


48 


347748.2.dec 


424588H1 


803 


871 


49 


401482.2.oct 


3927505H1 


130 


304 


48 


347748.2.dec 


425651 H1 


803 


1024 


49 


401482.2.oct 


5389939H1 


129 


296 


48 


347748.2.dec 


4281 93T6 


1121 


1733 


49 


401482.2.oct 


4612035H1 


124 


384 


48 


347748.2.dec 


5089180H1 


1169 


1430 


49 


401482.2.oct 


5687077H1 


124 


381 


48 


347748.2.dec 


2111317H1 


1292 


1552 


49 


401482.2.oct 


23781 81 H1 


127 


346 


49 


401482.2.oct 


338041 4H1 


141 


264 


49 


401482.2.oct 


2457930H1 


127 


280 


49 


401482.2.oct 


3806831 H1 


141 


260 


49 


401482.2.oct 


3889660H1 


130 


304 


49 


401482.2.oct 


3152624H1 


140 


323 


49 


401482.2.oct 


446081 1H1 


130 


243 


49 


401482.2.oct 


6008631 H1 


140 


311 


49 


401482.2.oct 


3878648H1 


130 


248 


49 


401482.2.OCt 


1599727H1 


140 


304 


49 


401482.2.oct 


868503H1 


129 


311 


49 


401482 2.oct 


4536938H1 


140 


258 


49 


401482.2.oct 


1642356H1 


129 


332 


49 


401482.2.0Ct 


3606772H1 


140 


252 


49 


401482.2.oct 


2739322H1 


129 


311 


49 


401482.2.oct 


4830281 H1 


141 


230 


49 


401482.2.oct 


2353257H1 


129 


281 


49 


401482.2.oct 


1308766H1 


140 


294 


49 


401482.2.oct 


5043438H1 


146 


271 


49 


401482.2.0Ct 


2666290H1 


140 


250 


49 


401482.2.oct 


1721132H1 


129 


311 


49 


401482.2.oct 


2072576H1 


140 


304 


49 


401482.2.oct 


194897H1 


129 


350 


49 


401482.2.0Ct 


4974839H1 


140 


304 


49 


401482.2.oct 


4528304H1 


129 


270 


49 


401482.2.oct 


3807429H1 


141 


304 


49 


401482.2.oct 


2859113H1 


129 


332 


49 


401482.2.oct 


2524335H1 


140 


311 


49 


401482.2.OCt 


3777804H1 


130 


269 


49 


401482.2.oct 


1527079H1 


140 


261 


49 


401482.2.oct 


2777981 H1 


129 


380 


49 


401482.2.oct 


137226H1 


140 


311 


49 


401482.2.oct 


1480087H1 


129 


311 


49 


401482.2.oct 


3081905H1 


140 


288 


49 


401482.2.oct 


814473H1 


129 


212 


49 


401482.2.oct 


228671 2H1 


140 


294 


49 


401482.2.OCt 


1375967H1 


129 


323 


49 


401482.2.oct 


2189156H1 


140 


248 


49 


401482.2.oct 


1556743H1 


129 


323 


49 


401482.2.oct 


1550788H1 


140 


304 


49 


401482.2.oct 


1912456H1 


129 


332 


49 


401482.2.oct 


2736933H1 


140 


230 


49 


401482.2.oct 


3861078H1 


129 


258 


49 


401482.2.oct 


592827H1 


140 


268 


49 


401482.2.oct 


371153H1 


129 


304 


49 


401482.2.oct 


2074671 H1 


140 


301 


49 


401482.2.oct 


3731 85H1 


129 


323 


49 


401482.2.oct 


6105031 H1 


140 


255 


49 


401482.2.oct 


4058394H1 


129 


235 


49 


401482.2.oct 


4563727H1 


140 


311 


49 


401482.2.oct 


116195H1 


129 


299 


49 


401482.2.oct 


2741386H1 


140 


266 


49 


401482.2.oct 


5106028H1 


131 


232 


49 


401482.2.oct 


3807627H1 


140 


265 


49 


401482.2.OC! 


4549024H1 


130 


304 


49 


401482.2.oct 


2854386H1 


140 


237 


49 


401482.2-OCt 


5861 54H1 


129 


371 


49 


401482.2.oct 


4833902H1 


140 


277 


49 


401482.2.oct 


2073746H1 


129 


219 


49 


401482.2.oct 


540768H1 


140 


311 


49 


401482.2.oct 


1281 13H1 


131 


245 
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49 


401482.2.OCt 


45851 85H1 


129 


364 


49 


401482.2.OC1 


g2325824 


1 


234 


49 


401482.2.oct 


1892064H1 


129 


304 


49 


401482.2.oct 


579481 0H1 


32 


300 


49 


401482.2.oct 


1625250H1 


129 


218 


49 


401482.2.oct 


5792265H1 


51 


300 


49 


401482.2.oct 


23571 89H1 


129 


261 


49 


401482.2.OCt 


5947232H1 


121 


353 


49 


401482.2.oct 


4630841 H1 


134 


272 


49 


401482.2.oct 


807770H1 


122 


224 


49 


401482.2.oct 


5699535H1 


129 


375 


49 


401482.2.oct 


5947296H1 


122 


223 


49 


401482.2.oct 


2071024H1 


129 


323 


49 


401482.2.oct 


3358464H1 


128 


419 


49 


401482.2.oct 


2808648H1 


127 


304 


49 


401482.2.oct 


g 161 4987 


147 


549 


49 


401482.2.oct 


4632985H1 


129 


385 


49 


401482.2.oct 


g1 61 4885 


235 


579 


49 


401482.2.oct 


94841 OH 1 


129 


311 


49 


401482.2.oct 


2096459H1 


132 


288 


49 


401482.2.oct 


26501 95H1 


129 


311 


49 


401482.2.oct 


5077509H1 


132 


311 


49 


401482.2.oct 


1911720H1 


129 


282 


49 


401482.2-oct 


3672962H1 


132 


267 


49 


401482.2.oct 


5557203H1 


129 


303 


49 


401482.2.oct 


2495269 H1 


133 


292 


49 


401482.2.oct 


51 87021 H1 


129 


280 


49 


401482.2.oct 


5582685H1 


133 


299 


49 


401482.2.oct 


4343951 H1 


129 


282 


49 


401482-2.oct 


2237565H1 


133 


231 


49 


401482.2.oct 


g 1734245 


129 


221 


49 


401482.2.oct 


440921 0H1 


133 


283 


49 


401482.2.oct 


g1 734234 


129 


221 


49 


401482.2.oct 


4455694H1 


133 


323 


49 


401482.2.oct 


45501 25H1 


130 


392 


49 


401482.2.oct 


082693H1 


133 


273 


49 


401482.2.oct 


2026970H1 


149 


204 


49 


401482.2.oct 


1738401 H1 


133 


244 


49 


401482.2.OCt 


3056876H1 


129 


339 


49 


401482.2.oct 


g1816100 


133 


293 


49 


401482.2.oct 


4721629H1 


130 


247 


49 


401482.2.oct 


1372579H1 


133 


237 


49 


401482.2.oct 


43471 91 H1 


128 


302 


49 


401482.2.oct 


g2005112 


132 


311 


49 


401482.2.oct 


5978251 H1 


130 


180 


49 


401482.2.oct 


437901 1H1 


132 


301 


49 


401482.2.oct 


4815016H1 


130 


311 


49 


401482.2.oct 


5162976H1 


132 


237 


49 


401482.2.oct 


3430533H1 


129 


337 


49 


401482.2.oct 


2518933H1 


132 


223 


49 


401482.2.oct 


1578109H1 


129 


347 


49 


401482.2.oct 


405083H1 


132 


266 


49 


401482.2.oct 


4580668H1 


129 


274 


49 


401482.2.oct 


2383796H1 


132 


301 


49 


401482.2.oct 


43491 30H1 


129 


330 


49 


401482.2.oct 


4585939H1 


132 


293 


49 


401482.2.oct 


1610424H1 


129 


304 


49 


401482.2.oct 


g 1439429 


132 


249 


49 


401482.2.oct 


3364209H1 


129 


260 


49 


401482.2.OCt 


5556467H1 


135 


300 


49 


401482.2.oct 


6136479H1 


130 


271 


49 


401482.2.oct 


3117637H1 


136 


304 


49 


401482.2.oct 


3398472H1 


130 


239 


49 


401482.2.oct 


3230457H1 


136 


304 


49 


401482.2.oct 


3015102H1 


129 


223 


49 


401482.2.oct 


3153383H1 


135 


311 


49 


401482.2.oct 


5081859H1 


129 


299 


49 


401482.2.oct 


637896H1 


143 


376 


49 


401482.2.oct 


4338871 H1 


129 


284 


50 


274551.1. oct 


g4325750 


1 


103 


49 


401482.2.oct 


458591 1H1 


130 


264 


50 


274551. Loot 


4290049F6 


1 


353 


49 


401482.2.oct 


113024H1 


130 


303 


50 


274551.1 -oct 


4290049H1 


1 


124 


49 


401482.2.oct 


540069H1 


136 


200 


50 


274551.1. oct 


5493752H1 


172 


444 


49 


401482.2x>ct 


6139403H1 


130 


332 


51 


411408.20.dec 


936827H1 


525 


638 


49 


401482.2.oct 


2652958H1 


130 


255 


51 


411408.20.dec 


23751 33H1 


443 


614 


49 


401482.2.oct 


4702083H1 


130 


234 


51 


411408.20.dec 


2300942H1 


492 


621 


49 


401482.2.oct 


3918683H1 


132 


289 


51 


411408.20.dec 


50231 09H1 


442 


622 


49 


401482.2.oct 


4821911H1 


129 


280 


51 


411408.20.dec 


4418917H1 


174 


410 


49 


401482.2.oct 


4501456H1 


129 


323 


51 


411408.20.dec 


3674792H1 


393 


749 


49 


401482.2.oct 


1899608H1 


129 


260 


51 


411408.20.dec 


2805943H1 


393 


756 


49 


401482.2.oct 


1610410H1 


129 


311 


51 


411408.20.dec 


1966865H1 


391 


720 


49 


401482.2.oct 


3891265H1 


144 


251 


51 


411408.20.dec 


4653403H1 


393 


734 


49 


401482.2.oct 


4502830H1 


143 


270 


51 


411408.20.dec 


495621 5H1 


393 


718 


49 


401482.2.OCt 


29241 59H1 


141 


243 


51 


411408.20.dec 


41 09951 H1 


393 


726 


49 


401482.2.oct 


2106528H1 


141 


285 


51 


411408.20.dec 


2812018H1 


393 


719 


49 


401482.2.oct 


1895745H1 


141 


293 


51 


411408.20.dec 


402737H1 


393 


742 


49 


401482.2.oct 


1799893H1 


141 


223 


51 


411408.20.dec 


3770868H1 


394 


765 


49 


401482.2.oct 


3440365H1 


141 


311 


51 


411408.20.dec 


3838768H1 


392 


737 


49 


401482.2.oct 


697099H1 


141 


276 


51 


411408.20.dec 


1403444H1 


393 


738 


49 


401482.2.oct 


5220546H1 


141 


311 


51 


411408.20.dec 


3667386H1 


393 


744 


49 


401482.2.oct 


4640268H1 


131 


304 


51 


411408.20.dec 


3153604H1 


394 


671 


49 


401482.2.oct 


1591126H1 


131 


304 


51 


411408.20.dec 


2992279H1 


392 


746 


49 


401482.2.OCt 


1591132H1 


131 


304 


51 


411408.20.dec 


3746578H1 


393 


587 


49 


401482.2.oct 


g 1270001 


133 


205 


51 


411408.20.dec 


4767753H1 


392 


734 


49 


401482.2.oct 


434681 5H1 


131 


244 


51 


411408.20.dec 


29431 78H1 


393 


764 


49 


401482.2.oct 


3959778H2 


132 


323 


51 


411408.20.dec 


2586769H1 


392 


743 


49 


401482.2.OCt 


117063H1 


133 


244 


51 


411408.20.dec 


3160466H1 


393 


743 


49 


401482.2.oct 


389507 H1 


131 


235 


51 


411408.20.dec 


2832332H1 


393 


701 


49 


401482.2.oct 


2737221 H1 


132 


304 


51 


411408.20.dec 


4912762H1 


402 


740 


49 


401482.2.oct 


5573958H1 


130 


344 


51 


411408.20.dec 


983356H1 


395 


758 


49 


401482.2.oct 


36041 59H1 


130 


280 


51 


411408.20.dec 


5191120H1 


392 


701 


49 


401482.2.oct 


6099064H1 


130 


295 


51 


411408.20.dec 


3637826H1 


394 


742 
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51 


411408.20.dec 


g2252026 


396 


814 


51 


411408.20.dec 


3477204H1 


387 


804 


51 


411408.20.dec 


6132793H1 


392 


724 


51 


411408.20.dec 


g 1357803 


390 


1071 


51 


411408.20 dec 


2907546H1 


395 


782 


51 


411408.20.dec 


4946208H1 


388 


685 


51 


411408.20.dec 


785404H1 


396 


742 


51 


411408.20.dec 


1576142H1 


388 


678 


51 


411408.20.dec 


185321 H1 


397 


613 


51 


411408.20.dec 


g2619527 


389 


792 


51 


411408.20.dec 


3891 85H1 


396 


743 


51 


411408.20.dec 


743255H1 


388 


719 


51 


411408.20.dec 


3534063H1 


396 


799 


51 


411408.20.dec 


4958275H1 


389 


719 


51 


411408.20.dec 


3996765H1 


396 


659 


51 


411408.20.dec 


2552783H1 


389 


692 


51 


411408.20.dec 


3053279H1 


398 


516 


51 


411408.20.dec 


5098761 H1 


389 


710 


51 


411408.20.dec 


1807609H1 


399 


736 


51 


411408.20.dec 


1478849H1 


389 


680 


51 


411408.20.dec 


2523052H1 


399 


737 


51 


411408.20.dec 


2856522H1 


392 


721 


51 


411408.20.dec 


4977531 H1 


399 


735 


51 


411408.20.dec 


5687388H1 


390 


708 


51 


411408.20.dec 


3858932H1 


400 


740 


51 


411408.20.dec 


4523823H1 


390 


716 


51 


411408.20.dec 


1 845501 H1 


398 


658 


51 


411408.20.dec 


3490029H1 


390 


750 


51 


411408.20.dec 


816386H1 


399 


746 


51 


411408.20.dec 


5046950H1 


390 


726 


51 


411408.20.dec 


4198843H1 


399 


743 


51 


411408.20.dec 


4586244H1 


390 


713 


51 


411408.20.dec 


4042101 H1 


398 


724 


51 


411408.20.dec 


2459258H1 


390 


708 


51 


411408.20.dec 


3750944H1 


400 


758 


51 


411408.20.dec 


1 858361 H1 


390 


709 


51 


411408.20-dec 


937287H1 


400 


738 


51 


411408.20.dec 


4416909H1 


389 


717 


51 


411408.20.dec 


3810490H1 


400 


740 


51 


411408.20.dec 


4523905H1 


389 


703 


51 


411408.20.dec 


3114955H1 


401 


756 


51 


411408.20.dec 


276091 8H1 


390 


762 


51 


411408.20.dec 


4566634H1 


401 


753 


51 


411408.20.dec 


4385055H1 


390 


713 


51 


411408.20.dec 


3705807H1 


402 


773 


51 


411408.20.dec 


4853641 H1 


390 


746 


51 


411408.20.dec 


4066846H1 


401 


743 


51 


411408.20.dec 


4819893H1 


388 


701 


51 


411408.20.dec 


371959H1 


403 


562 


51 


411408.20.dec 


3918493H1 


390 


726 


51 


411408.20.dec 


4626333H1 


403 


736 


51 


411408.20.dec 


4367949H1 


391 


703 


51 


411408.20.dec 


4956823H1 


403 


742 


51 


411408.20.dec 


4525882H1 


391 


703 


51 


411408.20.dec 


3670548H1 


406 


742 


51 


411408.20.dec 


4154002H1 


391 


718 


51 


411408.20.dec 


3051835H1 


403 


768 


51 


411408.20.dec 


5671105H1 


392 


590 


51 


411408.20.dec 


029644H1 


405 


745 


51 


411408.20.dec 


5574281 H1 


391 


703 


51 


411408.20.dec 


1535449H1 


404 


654 


51 


411408.20.dec 


4981071 H1 


391 


745 


51 


411408.20.dec 


35921 04H1 


405 


756 


51 


411408.20.dec 


4895035H1 


391 


744 


51 


411408.20.dec 


3993941 H2 


405 


748 


51 


411408.20.dec 


5122182H1 


392 


727 


51 


411408.20.dec 


5028325H1 


406 


750 


51 


411408.20.dec 


4329866H1 


391 


700 


51 


411408.20.dec 


380851 3H1 


419 


811 


51 


411408.20.dec 


5199638H1 


392 


544 


51 


411408.20.dec 


3076490H1 


423 


753 


51 


411408.20.dec 


2548578H1 


391 


687 


51 


411408.20.dec 


5020492H1 


423 


750 


51 


411408.20.dec 


4730925H1 


391 


717 


51 


411408.20.dec 


2318857H1 


423 


764 


51 


411408.20.dec 


349591 1H1 


391 


743 


51 


411408.20.dec 


506742H1 


424 


565 


51 


411408.20.dec 


2116906H1 


391 


713 


51 


411408.20.dec 


3995849H1 


423 


768 


51 


411408.20.dec 


2555366H1 


391 


691 


51 


411408.20.dec 


4126641 H1 


427 


781 


51 


411408.20.dec 


4544977H1 


391 


690 


51 


411408.20.dec 


3930274H1 


430 


700 


51 


411408.20.dec 


3373476H1 


391 


709 


51 


411408.20.dec 


2463303H1 


436 


756 


51 


411408.20.dec 


2966550H1 


392 


682 


51 


411408.20.dec 


3137131 H1 


436 


781 


51 


411408.20.dec 


2763323H1 


391 


692 


51 


411408.20.dec 


767735H1 


448 


735 


51 


411408.20.dec 


4874466H1 


391 


719 


51 


411408.20.dec 


g2056448 


461 


951 


51 


411408.20.dec 


2548853H1 


392 


712 


51 


411408.20.dec 


744424H1 


468 


749 


51 


411408.20.dec 


4841824H1 


391 


746 


51 


411408.20.dec 


508365H1 


470 


746 


51 


411408.20.dec 


4953663H1 


391 


695 


51 


411408.20.dec 


1541976H1 


527 


801 


51 


411408.20.dec 


5115420H1 


392 


735 


51 


411408.20.dec 


24041 34H1 


530 


791 


51 


411408.20.dec 


4845489H1 


391 


633 


51 


411408.20.dec 


1597375T6 


704 


861 


51 


411408.20.dec 


2684639H1 


391 


694 


51 


411408.20.dec 


5700996H1 


385 


736 


51 


411408.20.dec 


6376757H1 


392 


723 


51 


411408.20.dec 


2470451 H1 


385 


716 


51 


411408.20.dec 


2606634H1 


392 


698 


51 


411408.20.dec 


4980959H1 


385 


717 


51 


411408.20.dec 


32981 29H1 


391 


700 


51 


411408.20.dec 


2318026H1 


386 


713 


51 


411408.20.dec 


4767905H1 


390 


736 


51 


411408.20.dec 


4702372H1 


385 


705 


51 


411408.20.dec 


2256874H1 


391 


704 


51 


411408.20.dec 


6562862H1 


392 


830 


51 


411408.20.dec 


4982262H1 


391 


726 


51 


411408.20.dec 


2605436H1 


386 


704 


51 


411408.20.dec 


3662084H1 


391 


742 


51 


411408.20.dec 


3567254H1 


386 


650 


51 


411408.20.dec 


4843594H1 


392 


718 


51 


411408.20.dec 


4976438H1 


386 


712 


51 


411408.20.dec 


3675093H1 


392 


746 


51 


411408.20.dec 


1984353H1 


386 


724 


51 


411408.20.dec 


4385088H1 


391 


712 


51 


411408.20.dec 


5946551 H1 


386 


746 


51 


411408.20.dec 


2548726H1 


392 


709 


51 


411408.20.dec 


764555H1 


387 


703 


51 


411408.20.dec 


3351335H1 


390 


713 


51 


411408.20.dec 


g1 955984 


387 


714 


51 


411408.20.dec 


3442587H1 


392 


565 


51 


411408.20.dec 


73641 3R1 


388 


859 


51 


411408.20.dec 


3995356H2 


392 


745 


51 


411408.20.dec 


4327844H1 


391 


700 


51 


411408.20.dec 


465371 1H1 


392 


745 


51 


411408.20.dec 


3617318H1 


388 


758 


51 


411408.20.dec 


4977378H1 


392 


731 
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51 


411408.20.dec 


3456074H1 


392 


711 


51 


411408.20.dec 


2591832H1 


367 


654 


51 


411408.20.dec 


4765640H1 


392 


710 


51 


411408.20.dec 


2155019H1 


361 


448 


51 


411408,20.dec 


2544077H1 


392 


743 


51 


411408.20.dec 


1709953H1 


373 


663 


51 


411408.20.dec 


51 75221 H1 


392 


546 


51 


411408.20.dec 


4083858H1 


373 


712 


51 


411408.20.dec 


2649975H1 


393 


709 


51 


411408.20.dec 


2554967H1 


374 


663 


51 


411408.20.dec 


780432H1 


392 


526 


51 


411408.20.dec 


399331 4H1 


374 


743 


51 


411408.20.dec 


161 3961 H1 


393 


517 


51 


411408.20.dec 


49751 09H1 


373 


724 


51 


411408 20 dec 

■ ■ ■ * x^ w • xx » *-* xv 


780667H1 


392 

W S~m- 


734 


51 


411408.20.dec 


646957H1 


380 


717 


51 


411408 20 dec 

» ■ ■ ■ X^ W • *— XX . XJ X^ V 


4976638H1 


391 


723 


51 


411408.20.dec 


2150550H1 


383 


736 


51 


41 1408 20 dec 


4955591 H1 


392 


735 


51 


411408.20.dec 


3179540H1 


382 


768 


51 


41 1408 20 dec 


4917657H1 


392 


735 


51 


411408.20.dec 


1970802H1 


381 


731 


51 


41 1408 20 dec 


2211455H1 


393 


714 


51 


411408.20.dec 


3737635H1 


382 


759 


51 


41 1408 20 dec 


4750777H1 

f x^^^ « F fill 


392 


737 


51 


411408.20.dec 


4521821 H1 


382 


703 


51 


41 1408 20 dec 

"Til TUU.L.U.UCVf 


4372262H1 


392 


729 


51 


411408.20.dec 


4556748H1 


382 


700 


51 


41 1408 20 dec 

1 1 » VU • xx * x^xv 


250221 7H1 


392 


698 


51 


411408.20.dec 


1798726H1 


382 


691 


51 


411 408 20 dec 


456491 4H1 


392 


695 


51 


411408.20.dec 


4145903H1 


382 


789 


51 


41 1408 20 dec 


4546001 H1 


392 


754 


51 


411408.20.dec 


4150946H1 


382 


703 


51 


41 1408 20 dpc 

"■Til tUU.CU.UCV/ 


4124159H1 


392 


729 

9 xx 


51 


411408.20.dec 


3118151H1 


383 


749 


51 


41 1408 20 dec 


3976845H1 


392 


613 


51 


411408.20.dec 


1 97341 4H1 


383 


704 


51 


41 1408 20 dec 


4503808H1 


390 

*V V V 


713 


51 


411408.20.dec 


3621844H1 


382 


723 


51 


41 1408 20 dec 


3055191H1 


391 


743 


51 


411408.20.dec 


2156096H1 


387 


686 


51 


41 1 408 20 dec 


4662503H1 


392 


703 


51 


411408.20.dec 


4150328H1 


381 


718 


51 


41 1408 20 dec 

■ ■ ^TxXW a«M%r ♦ X-ffX^XX 


2605825H1 


392 


736 


51 


411408.20.dec 


2806959H1 


382 


661 


51 


411408 20 dec 

■ ■ I ■ Xrf • X^ • X^X-*" X^ 


1005043H1 


392 


720 


51 


411408.20.dec 


505991 4H1 


382 


716 


51 


41 1408 20 dec 


2446394H1 


393 


674 


51 


411408.20.dec 


5053073H1 


384 


603 


51 


411408 20.dec 

• ■ W » X^ X^ » X^ • X^ X^ *^ 


4990021 H1 


392 


726 


51 


411408.20.dec 


3898664H1 


383 


705 


51 


411408.20.dec 


4148690H1 


392 


729 


51 


411408.20.dec 


26681 17H1 


382 


710 


51 


41 1408 .20. dec 


3607084H1 


392 


663 


51 


411408.20.dec 


3023657H1 


382 


724 


51 


411408.20.dec 

» » • * ^^X^ * M« X^ * X* X^ X^ 


50851 83H1 


391 


704 


51 


411408.20.dec 


2551978H1 


383 


712 


51 


41 1408 20 dec 

■ I v ■ xx xx » < ■ x^ * x*« x^ x^ 


2857082H1 


392 


711 


51 


411408.20.dec 


4710332H1 


384 


736 


51 


411408 20 dec 


2632563H1 


392 


734 


51 


411408.20.dec 


4138190H1 


384 


751 


51 


411408.20.dec 


4906727H2 


392 


759 


51 


411408.20.dec 


56841 04H1 


385 


727 


51 


411408.20.dec 


2700833H1 


392 


658 


51 


411408.20.dec 


4797725H1 


386 


714 


51 


411408 20 dec 

■ w m w x^x^ * x^ » x*i x^ x^ 


2535528H1 


392 


657 


51 


411408.20.dec 


5669575H1 


390 


627 


51 


41 1408 20 dec 

» ff ■ r xx vx • *^ v/ * Xj» x* x«* 


3490780H1 


392 


759 


51 


411408.20.dec 


598827H1 


442 


552 


51 


41 1408 20 dec 


3894336H1 


392 


761 


51 


411408.20.dec 


891997H1 


443 


578 


51 


41 1408 20 dec 

T I ■ w X*r X»» • X»» » X^ 


397051 3H1 


392 


758 


51 


411408.20.dec 


753453H1 


491 


588 


51 


41 1408 20 dec 


31 71 791 H1 

IV ■ ff ■ ■ ■ ■ 


392 


618 


51 


411408.20.dec 


2465958H1 


483 


593 


51 


41 1408 20 dec 


1723460H1 


392 


611 


51 


411408.20.dec 


4831218H1 


444 


579 


51 


41 1408 20 dec 


3476891 H1 

V-r ■ ff www till 


392 


807 


51 


411408.20.dec 


598686H1 


442 


601 


S1 


41 1408 20 dec 


4981057H1 

ff V Vw* ■ V#* ff ff I ff 


392 


749 


52 


035973.1. dec 


5401350H1 


1 


105 


S1 


41 1408 20 dec 


4802806H1 


392 


749 


52 


035973.1. dec 


605761 7H1 


56 


643 


51 


41 1408 20 dec 


4110394H1 

■ ■ *^*^x^ rff ■ ff 


392 


588 


52 


035973.1. dec 


g32 14092 


406 


782 


S1 


41 1408 20 dec 


3575464H1 


392 


565 


52 


035973.1. dec 


35241 02H1 


479 


779 


51 


41 1408 20 dec 


1729387H1 


1 


113 


53 


456536.1. dec 


g8 19467 


922 


1236 


51 


41 1408 20 dec 


3000408H1 


5 


303 


53 


456536.1. dec 


4591941 H1 


925 


1184 


51 


411408 20 dec 
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1029 


1223 


53 


456536.1. dec 


4632451 H1 


426 


684 


53 


456536.1. dec 


g21 02970 


1043 


1226 


53 


456536.1. dec 


g990010 


428 


684 


53 


456536.1. dec 


2987426H1 


231 


508 


53 


456536.1. dec 


2395270H1 


438 


665 


53 


456536.1. dec 


3377045H1 


127 


377 


53 


456536.1. dec 


1488271 H1 


479 


743 


53 


456536.1. dec 


1899725H1 


123 


379 


53 


456536.1. dec 


4208981 H1 


404 


537 


53 


456536.1. dec 


1900788H1 


123 


366 


53 


456536.1. dec 


4727980H1 


417 


678 


53 


456536.1. dec 


g 1298038 


126 


378 


53 


456536.1. dec 


2560321 H1 


401 


663 


53 


456536.1. dec 


449807H1 


131 


296 


53 


456536.1. dec 


1 575881 H1 


70 


187 


53 


456536.1. dec 


3460674H1 


108 


331 


53 


456536.1. dec 


3747523H1 


70 


371 


53 


456536.1. dec 


1751340H1 


108 


312 


53 


456536.1. dec 


1450986H1 


69 


322 


53 


456536.1. dec 


2110208H1 


109 


366 


53 


456536.1. dec 


3024440H1 


69 


312 


53 


456536.1. dec 


5865275H1 


121 


406 


53 


456536.1. dec 


2651 171 H1 


73 


208 


53 


456536.1. dec 


g831903 


134 


456 


53 


456536.1. dec 


1225060H1 


57 


292 


53 


456536.1 .dec 


744457H1 


135 


388 


53 


456536.1. dec 


6298584H1 


64 


309 
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53 


456536.1. dec 


540325H1 


67 


275 


55 


406790.3.dec 


5812480H1 


19 


295 


53 


456536.1. dec 


3537964H1 


68 


368 


55 


406790.3.dec 


3458385H1 


19 


249 


53 


456536.1 .dec 


3456808H1 


68 


311 


55 


406790.3.dec 


64331 30H1 


45 


638 


53 


456536.1. dec 


3189412H1 


70 


389 


55 


406790.3.dec 


995381 T6 


207 


791 


53 


456536.1. dec 


g4899087 


812 


949 


55 


406790.3.dec 


5594655H1 


375 


625 


53 


456536.1 .dec 


g3753281 


812 


1221 


55 


406790.3.dec 


g2341498 


402 


767 


53 


456536.1. dec 


g3598298 


812 


1225 


55 


406790.3.dec 


g4664630 


422 


859 


53 


456536.1. dec 


g41 13780 


812 


1221 


55 


406790.3.dec 


g5396183 


431 


859 


53 


456536.1 .dec 


2202050H1 


794 


1047 


55 


406790.3.dec 


g4311961 


435 


856 


53 


456536.1. dec 


1863502T6 


794 


1174 


55 


406790.3.dec 


g3840805 


503 


859 


53 


456536.1. dec 


g3676957 


787 


1221 


56 


412420.63.dec 


g666234 


1 


279 


53 


456536.1. dec 


g2007458 


779 


1090 


56 


412420.63.dec 


g4487110 


1 


296 


53 


456536.1. dec 


g2787281 


782 


949 


56 


412420.63.dec 


g 1977452 


1 


228 


53 


456536.1. dec 


g3649015 


784 


1227 


56 


412420.63.dec 


g2805755 


1 


83 


53 


456536.1. dec 


1923568H1 


761 


1024 


56 


412420.63.dec 


2898360H1 


1 


99 


53 


456536.1. dec 


g3924137 


763 


1222 


57 


196623.3.dec 


2517648H1 


1 


231 


53 


456536.1. dec 


g2835063 


763 


1215 


57 


196623.3.dec 


5686895H1 


1 


273 


53 


456536.1. dec 


2752626H1 


759 


1037 


57 


196623.3.dec 


4645207H1 


3 


207 


53 


456536.1. dec 


g3330927 


776 


1217 


57 


196623.3.dec 


4605887H1 


4 


259 


53 


456536.1. dec 


1687386H1 


833 


1041 


57 


196623.3.dec 


2901373F6 


5 


502 


53 


456536.1 .dec 


g4650333 


834 


949 


57 


196623.3.dec 


2901373H1 


5 


293 


53 


456536.1. dec 


6166167H1 


843 


1222 


57 


196623.3.dec 


1 39401 6F6 


5 


393 


53 


456536.1. dec 


g5392951 


841 


1222 


57 


196623.3.dec 


1600290H1 


7 


202 


53 


456536.1. dec 


1982079T6 


814 


1180 


57 


196623.3.dec 


1 39401 6H1 


5 


261 


53 


456536.1. dec 


1982079H1 


814 


1038 


57 


196623.3.dec 


1395280H1 


5 


254 


53 


456536.1. dec 


g2656831 


813 


1221 


57 


196623.3.dec 


269671 4H1 


6 


236 


53 


456536.1 .dec 


g2753183 


815 


949 


57 


196623.3.dec 


g2032778 


11 


208 


53 


456536.1. dec 


1982079R6 


814 


1164 


57 


196623.3.dec 


3874480H1 


13 


267 


53 


456536.1. dec 


g4096028 


831 


949 


57 


196623.3.dec 


5676590H1 


15 


276 


53 


456536.1. dec 


g3924307 


812 


1214 


57 


196623.3. dec 


5410878H1 


17 


272 


54 


387807 .4.oct 


37021 47H1 


1 


291 


57 


196623.3.dec 


1552649H1 


17 


210 


54 


387807.4.oct 


g3307321 


221 


621 


57 


196623.3.dec 


4823454H1 


18 


179 


54 


387807.4.oct 


g3644993 


223 


518 


57 


196623.3.dec 


4506764H1 


26 


295 


54 


387807.4.oct 


g2836106 


227 


596 


57 


196623.3.dec 


4302206H1 


24 


279 


54 


387807.4.oct 


g3431952 


226 


573 


57 


196623.3.dec 


6515768H1 


30 


490 


54 


387807.4.oct 


g2714088 


228 


633 


57 


196623.3.dec 


2716365H1 


26 


286 


54 


387807 .4.oct 


4560443T6 


333 


631 


57 


196623.3.dec 


586681 8H1 


27 


279 


54 


387807.4.OCt 


5980275H1 


342 


631 


57 


196623.3.dec 


3136703H1 


26 


312 


54 


387807.4.oct 


292878H1 


416 


521 


57 


196623.3.dec 


3873631 H1 


26 


330 


54 


387807.4.oct 


1419003T6 


427 


527 


57 


196623.3.dec 


376501 4H1 


28 


321 


54 


387807.4.oct 


503991 1H1 


498 


740 


57 


196623.3.dec 


33521 94H1 


26 


281 


54 


387807.4.OCt 


4893519H1 


536 


615 


57 


196623.3.dec 


2659790H1 


26 


252 


54 


387807.4.oct 


4532934H1 


638 


889 


57 


196623.3.dec 


4876834H1 


29 


300 


54 


387807.4.oct 


31 88491 H1 


643 


867 


57 


196623.3.dec 


377041 5H1 


29 


341 


54 


387807.4.oct 


31 88491 R6 


643 


986 


57 


196623.3.dec 


4530091 H1 


31 


274 


54 


387807.4.oct 


2274775H1 


643 


798 


57 


196623.3.dec 


1631018H1 


29 


147 


54 


387807.4.oct 


1419003F6 


643 


1075 


57 


196623.3.dec 


54401 30H1 


32 


245 


54 


387807.4.oct 


3491191H1 


643 


732 


57 


196623.3.dec 


3893567H1 


31 


199 


54 


387807 .4.cct 


63221 66H1 


643 


827 


57 


196623.3.dec 


3633822H1 


33 


314 


54 


387807 .4,oct 


g1 984547 


654 


954 


57 


196623.3.dec 


2210577H1 


32 


291 


54 


387807.4.oct 


5272773H1 


683 


764 


57 


196623.3.dec 


4668793H1 


34 


297 


54 


387807.4.oct 


5098260H1 


697 


962 


57 


196623.3.dec 


3456796H1 


33 


274 


54 


387807 .4,oct 


4245582H1 


784 


1019 


57 


196623.3.dec 


223764R1 


33 


631 


54 


387807.4.oct 


440341 4H1 


790 


922 


57 


196623.3.dec 


3833345H1 


33 


300 


54 


387807.4.oct 


1419059H1 


851 


1075 


57 


196623.3.dec 


63771 78H1 


33 


279 


54 


387807 .4.oct 


1419003H1 


861 


1075 


57 


196623.3.dec 


223764H1 


33 


256 


54 


387807 .4.oct 


6102706H1 


901 


1194 


57 


196623.3.dec 


225430H1 


33 


248 


54 


387807.4.oct 


g2629621 


914 


1312 


57 


196623.3.dec 


067067H1 


33 


178 


54 


387807 .4.oct 


2490348H1 


1115 


1342 


57 


196623.3.dec 


g1 72881 3 


36 


284 


54 


387807.4.oct 


3617390H1 


1143 


1339 


57 


196623.3.dec 


24031 39H1 


33 


271 


55 


406790.3.dec 


3661 11H1 


1 


236 


57 


196623.3.dec 


801372H1 


38 


259 


55 


406790.3.dec 


g2055030 


2 


422 


57 


196623.3.dec 


2411122H1 


38 


254 


55 


406790.3.dec 


2817564H1 


16 


272 


57 


196623.3.dec 


3596160H1 


38 


312 


55 


406790.3.dec 


3449061 H1 


17 


259 


57 


196623.3.dec 


067065H1 


41 


194 


55 


406790.3.dec 


995381 R6 


18 


553 


57 


196623.3.dec 


3584836H1 


35 


312 


55 


406790.3.dec 


995381 H1 


18 


326 


57 


196623.3.dec 


167565H1 


56 


371 


55 


406790.3.dec 


6433084H1 


45 


462 


57 


196623.3.dec 


3859670H1 


58 


338 
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57 


196623.3.dec 


g2032705 


59 


366 


59 


264633.8dec 


3580034F6 


2957 


3319 


57 


1 96623.3.dec 


g4737259 


137 


499 


59 


264633.8.dec 


3580034H1 


2959 


3246 


57 


196623.3.dec 


4092207H1 


226 


474 


59 


264633.8.dec 


52731 11H1 


2974 


3231 


57 


196623.3. dec 


4581304H1 


236 


481 


59 


264633.8.dec 


906926H1 


3013 


3173 


57 


1 96623.3.dec 

1 X^ XjV XV*!"**** • X^ # X^ 


8981 09 R6 


241 


647 


59 


264633.8.dec 


1439286H1 


3061 


3307 


57 


196623 3 dec 


8981 09H1 


241 


492 


59 


264633.8.dec 


1 69981 R1 


3092 


3365 


57 


196623.3.dec 


030943H1 

X** X^ SmW W Xm* 1 1 1 


568 


844 


59 


264633.8.dec 


169981 F1 


3092 


3366 


57 

w * 


1 96623 3 dec 


2006252H1 


568 


727 


59 


264633.8.dec 


169981 H1 


3092 


3320 


57 


1 96623 3 dec 


3901332H1 


581 


810 


59 


264633.8.dec 


6493338H1 


3130 


3608 


57 


1 96623 3 dec 

■ x«* ^v w^-> vv • XV ■ \m*^»r \+ 


2007562 H1 


655 


838 


59 


264633.8.dec 


1320776H1 


3138 


3388 


57 


1 96623 3 dec 


38581 13H1 


759 


1038 


59 


264633.8.dec 


492151 H1 


3146 


3409 


57 

w 9 


1 96623 3 dec 


4422306 H1 


759 


1021 


59 


264633.8.dec 


3460677H1 


3158 


3381 


58 

w V 


427916 8 dec 


1413110H1 

■ r 1 x^ ■ t x^ ■ * » 


1 


161 


59 


264633.8.dec 


4741080H1 


3163 


3370 


58 


427916 8 dec 


14131 10F6 

t • 1 X-* 1 ■ XV I w 


1 


457 


59 


264633.8.dec 


5616847H1 


3175 


3284 


58 


427916 8 dec 


2483841 H1 

■ W x^* x^ x ■ ■ • » 


1 


196 


59 


264633.8.dec 


6306822H1 


3225 


3669 


58 


427916 8 dec 


1415706H1 

■ I X/ / vWl ■ ■ 


1 


182 


59 


264633.8.dec 


5313464H1 


3233 


3301 


58 


427916 8 dec 


a389421 

Vj WWW ■ 


1 


251 


59 


264633.8.dec 


1558226F6 


3236 


3491 


58 


427916 8 dec 


4434759H1 


26 


282 


59 


264633.8.dec 


345031 2H1 


3286 


3532 


58 


427916 8 dec 


5529558H1 

WWta W WWW 1 ■ ■ 


32 


309 


59 


264633.8.dec 


1 669451 H1 


3295 


3526 


58 


427916 8 dec 


3360746H1 

WW WW ■ W^mTW ¥ W 


34 


291 


59 


264633.8.dec 


6546636H1 


3352 


3785 


58 


427916 8 dec 

W ■» i r x^ * xv » x^ » x«* x^ x** 


3039830H1 


87 


370 


59 


264633.8.dec 


275341 0T6 


3317 


3910 


58 


427916 8 dec 


53571 36H1 


138 


317 


59 


264633.8.dec 


5906384H1 


3323 


3596 


58 


427916 8 dec 

W hmmm ' X^ 1 X** » XjV * X^ XV X^ 


5310884H1 


161 


386 


59 


264633.8.dec 


g1 7361 30 


3367 


3711 


58 


42791 6.8.dec 


2227664H1 


163 


341 


59 


264633.8-dec 


g1689800 


3490 


3578 


58 


42791 6.8.dec 


4289094H1 


212 


340 


59 


264633.8.dec 


3639891 H1 


3518 


3788 


59 


264633 8 dec 


2760074H1 


2557 


2829 


59 


264633.8.dec 


4182954H1 


3564 


3754 


59 


264633.8.dec 


5477979H1 


2625 


2764 


59 


264633.8.dec 


5615577H1 


3592 


3879 


59 


264633 8.dec 

X^ ■ XV XV XV v xv • M XV XV 


g3041476 


4970 


5278 


59 


264633.8.dec 


4958507H1 


3619 


3783 


59 


264633.8.dec 


g2881327 


5011 


5285 


59 


264633.8-dec 


g1 558771 


3639 


4036 


59 


264633.8-dec 


4005052H2 


3711 


3945 


59 


264633.8.dec 


50841 30H1 


3643 


3871 


59 

WW 


264633 8.dec 

^_^v rvw .xv^x^xv^v 


g51 23788 


1 


7677 


59 


264633.8.dec 


1645888F6 


3673 


4069 


59 


264633.8. dec 

- XV ■ XVXVXV m xv » X^ xv X^ 


293961 3H1 


1 


92 


59 


264633.8-dec 


g5452540 


4101 


4355 


59 


264633.8.dec 


g3839173 


7222 


7683 


59 


264633-8.dec 


g4987546 


5067 


5285 


59 

w w 


264633.8-dec 


g3959822 


7238 


7663 


59 


264633.8.dec 


g1 7341 59 


5073 


5285 


59 


264633 8 dec 

Am. W~ V W 9 XV * Xp# W V«* 


q4332118 

XJ W X^ Wa* » ■ X^ 


7244 


7702 


59 


264633.8.dec 


1499149H1 


5076 


5285 


59 


264633 8 dec 

fcW~www » xV »x*l WxV 


2203534H1 


7461 


7671 


59 


264633.8.dec 


6111175H1 


5078 


5329 


59 


264633 8 dec 

£_ \v*~ W w W - Vv* w w w 


275341 0R6 

*— » ww r i x^ ■ «x^ 


2446 


3017 


59 


264633.8.dec 


221 401 OH 1 


5097 


5342 




264633 8 dec 


275341 0H1 

^ § WW~ ■ x/ ■ ■ ■ 


2446 


2701 


59 


264633.8-dec 


2119572H1 


5046 


5289 


59 

ww 


264633 8 dec 

*mm W » XV WXV » xv * x-» xv x«* 


3669959H1 

WW WW ■ ■ ■ 


6210 


6495 


59 


264633.8.dec 


3178947H1 


5047 


5285 


5Q 


264633 8 dec 


a2878174 


4505 


4894 


59 


264633-8.dec 


g4738584 


5049 


5285 


59 


264633.8.dec 

b XV » W XV XV • XV * X^ X^ X^ 


6245402H1 


4477 


5044 


59 


264633.8.dec 


4888703H1 


5064 


5341 


59 


264633 8 dec 


027321 H1 


4504 


4746 


59 


264633.8.dec 


g2241797 


7304 


7680 




264633 8 dec 


□2240505 


4505 


4917 


59 


264633.8.dec 


g3843660 


7321 


7697 


59 


264633 8 dec 

W W 7 « W wW • KV ■ XV x^ V 


4884709 H1 

vw ■ ■ W X** ■ ■ ■ 


4860 


5051 


59 


264633.8-dec 


6386767H1 


5770 


6050 


59 


264633 8 dec 


g1 7361 31 


4841 


5287 


59 


264633.8.dec 


4547233H1 


5699 


5962 


59 

WW 


264633 8 dec 

w rv w*w ■ xv x-* xv 


2053603H1 


4841 


5092 


59 


264633.8.dec 


5193918F6 


5913 


6294 


59 


264633 8 dec 

W rWVWtV • VjIw xv 


4825493H1 

~ WWW i w* ■ ■ 


4851 


5103 


59 


264633.8.dec 


1903050H1 


5043 


5287 


59 


264633.8 dec 

■ ■ X^ ? x^X^X^ ■ X^ » X^ x-r x^ 


2047635H1 


4815 


5046 


59 


264633.8-dec 


4706229H1 


2406 


2663 


59 


264633.8.dec 

1 ■ X*^ r Vr W * Xj»t ♦ x«" 


g2566999 


4825 


5246 


59 


264633.8.dec 


5193918H1 


5913 


6179 


59 


264633.8.dec 


2688383H1 


4752 


5008 


59 


264633.8.dec 


645727H1 


5105 


5362 


59 


264633.8.dec 

X»T 1 x*x«x»»x*»**ww 


g2705809 


4781 


5279 


59 


264633.8.dec 


861167H1 


5222 


5452 


59 


264633.8.dec 


625590H1 


4540 


4777 


59 


264633-8.dec 


2834470H1 


5222 


5491 


59 


264633.8.dec 


3375634H1 


4577 


4814 


59 


264633.8.dec 


2263921 H1 


5231 


5460 


59 


264633.8-dec 


298621 2H1 


4539 


4795 


59 


264633.8.dec 


1669519H1 


5280 


5508 


59 


264633.8.dec 


1891014H1 


4653 


4935 


59 


264633.8.dec 


g3756312 


4864 


5285 


59 


264633 8 dec 


3622936H1 


4668 


4948 


59 


264633.8.dec 


g3734545 


4143 


4355 


59 


264633.8.dec 


5838668H1 


7176 


7397 


59 


264633.8.dec 


g2816934 


4309 


4376 


59 


264633.8.dec 


2964536H1 


7182 


7478 


59 


264633.8.dec 


g1 558712 


4905 


5285 


59 


264633.8.dec 


5889738H1 


6260 


6544 


59 


264633.8.dec 


g4312948 


4903 


5217 


59 


264633.8.dec 


5882902H1 


6260 


6547 


59 


264633.8-dec 


g3896413 


4922 


5285 


59 


264633.8.dec 


2936459H1 


7383 


7655 


59 


264633.8.dec 


g3432963 


4928 


5285 


59 


264633.8.dec 


g1 01 0200 


7376 


7694 


59 


264633.8-dec 


g3753704 


4930 


5285 


59 


264633.8.dec 


g1 01 1545 


7394 


7674 


59 


264633.8-dec 


g3040420 


3945 


4355 


59 


264633.8.dec 


g 1026402 


7401 


7690 


59 


264633.8.dec 


g3917139 


3960 


4355 


59 


264633.8.dec 


g2902950 


4086 


4337 


59 


264633.8.dec 


g3835247 


3937 


4355 


59 


264633.8.dec 


2055804H1 


4072 


4329 


59 


264633.8.dec 


g4896231 


3944 


4355 


59 


264633.8.dec 


g2788822 


4078 


4376 


59 


264633.8.dec 


g1 958739 


7059 


7443 
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59 


264633.8.dec 


5153385H1 


7130 


7393 


59 


264633.8.dec 


4004375H1 


3853 


4015 


59 


264633.8.dec 


5735626H1 


7042 


7292 


59 


264633.8.dec 


3314920H1 


3729 


3960 


59 


264633.8.dec 


g2360618 


7048 


7445 


59 


264633.8. dec 


g2567000 


4974 


5241 


59 


264633.8.dec 


5620491 R8 

*^*#*a__*# T w %pr ■ i 


4132 


4404 


59 


264633.8.dec 


1903050T6 


5000 


5251 


59 


264633.8.dec 


g1 71 3052 


4134 


4351 


59 


264633.8.dec 


g5236647 


3912 


4366 


59 


264633.8.dec 


1645888H1 


3673 


3833 


59 


264633.8.dec 


g2908547 


3915 


4337 


59 


264633.8-dec 


q3805397 

W W W WW * 


3679 


3967 


59 


264633.8.dec 


2831092H1 


3928 


4003 


59 


264633.8.dec 


009055R6 


7360 


7680 


59 


264633.8.dec 


5882023H1 


6260 


6521 


59 


264633 8 dec 


009055H1 

WWWWWKi'l 1 ■ 


7360 


7608 


59 


264633.8.dec 


5887277H1 


6260 


6462 


59 

w w 


264633 8 dec 


n4453396 

T T W W W W W 


7361 


7685 


59 


264633.8.dec 


3931164H1 


6224 


6500 


59 


264633 8 dec 


01379240 


7372 


7680 


59 


264633.8.dec 


5545540H1 


2633 


2828 


59 


264633 8 dec 

JU- wT^ W w » w * V-J w w 


6321696H1 

w I v/w wl 1 ■ 


1962 


2237 


59 


264633.8.dec 


5546368H1 


2643 


2831 


59 

w w 


264633 8 dec 

*— - V » VWW ■ w *w w w 


3339964H1 

V^WWWWX*^TI ■ ' 


2174 


2441 


59 


264633.8.dec 


g4522792 


7268 


7680 


59 

W W 


264633 8 dec 

W~ W W W . W *Uv7Lf 


4910157H1 

^Tw 1 \J 1 w /III 


1847 


2112 


59 


264633.8.dec 


3839760H1 


7275 


7566 




264633 8 dec 


a1 977752 


1628 

1 W i— ■ W 


1793 


59 


264633.8.dec 


g 1623287 


7296 


7693 


59 


264633 8 dec 


05689516 

y v ww\/ t w 


1736 


4405 


59 


264633.8.dec 


g752466 


7403 


7674 


59 


264633 8 dec 


a4076602 


7402 


7703 


59 


264633.8.dec 


g991482 


7401 


7663 


59 

ww 


264633 8 dec 

t_ W*"T W WW - W * VI WW 


a5707037 

VJ w / \J i WW * 


7552 


7689 


59 


264633.8.dec 


g2877698 


3985 


4337 


59 

w*7 


264633 8 dec 


21 49426 H1 


7582 


7674 


59 


264633.8.dec 


g2876827 


4035 


4228 


59 
ww 


264633 8 dec 

■Ww^^Vif %*/w- w- ww w 


1669010H1 


5280 


5510 


59 


264633.8.dec 


2055804R6 


4072 


4354 


59 

v W 


264633 8 dec 


02008277 


5382 


5615 


59 


264633.8.dec 


g5529021 


3985 


4355 


59 

ww 


264633 8 dec 


4178644H1 


5659 


5912 


59 


264633.8.dec 


401718H1 


7001 


7266 


59 

WW 


264633 8 dec 


6181567H1 


5672 


5898 


59 


264633.8.dec 


g5591949 


7006 


7283 


59 

WW 


264633 8. dec 


6209982H1 


7225 


7524 


59 


264633.8.dec 


g2328920 


7012 


7445 


59 


264633 8 dec 


5020453T1 


7252 


7655 


59 


264633.8.dec 


2649891 H1 


7029 


7282 


59 


264633.8. dec 


g3246288 


7247 


7684 


59 


264633.8.dec 


g1 623286 


6986 


7305 


59 


264633.8.dec 


g4982958 


7248 


7639 


59 


264633.8.dec 


g2902964 


6987 


7454 


59 


264633 .8.dec 


5193918T6 


7257 


7678 


59 


264633.8.dec 


5020453H1 


6854 


6978 


59 


264633. 8. dec 


g2957486 


7258 


7445 


59 


264633.8.dec 


5093687H1 


6891 


7158 


59 

WW 


264633 8 dec 


055084H1 


7269 


7435 


59 


264633.8.dec 


g1 026722 


6895 


7268 


59 

WW 


264633 8 dec 


067973H1 


7272 


7458 


59 


264633.8.dec 


g1 02501 7 


6895 


7114 


59 


264633.8.dec 


3867461 H1 


6384 


6511 


59 


264633.8.dec 


5094369H1 


6929 


7172 


59 

WW 


264633 8.dec 


5807225H1 


6260 


6491 


59 


264633.8.dec 


26531 54H1 


6938 


7179 


59 


264633 8 dec 


a2589262 

WbhWWWCbWCm 


4437 


4915 


59 


264633.8.dec 


3172174H1 


6957 


7172 


59 


264633 8 dec 


a4901330 


4427 


4874 


59 


264633.8.dec 


6209435H1 


6983 


7251 




264633 8 dec 


a1 9281 77 

*4 * W*v_*-* • 9 W 


4469 


4715 


59 


264633.8.dec 


2518677H1 


6984 


7221 


59 


264633 8 dec 


a 1524709 


4876 


5295 


59 


264633.8.dec 


2896833H1 


6696 


6972 


59 

WW 


264633 8 dec 

I WWW B *J « "hat 


1903050F6 


4895 


5287 


59 


264633.8.dec 


g752465 


6709 


6988 


59 

WW 


264633 8 dec 

4W» W^^V*» WW » W B W V* 


5881 559 H1 

Vb*Vp#*«^ ■ WWW 111 


6260 


6525 


59 


264633.8.dec 


3403481 H1 


6715 


6956 


S9 

w W 


264633 8 dec 


5883369 H1 

WW WWW ■ ■ ■ 


6260 


6522 


59 


264633.8.dec 


5040507H1 


6715 


6952 


59 


264633 8 dec 


2412358H1 

W. f 1 C_WV/ Wl I ■ 


6332 


6579 


59 


264633.8.dec 


5041590H1 


6716 


6961 


59 


264633 8 dec 


5544024H1 


6356 


6554 


59 


264633.8.dec 


3988731 H1 


6719 


7004 


59 

WW 


264633 8 dec 

4Uv~VVAf .\«#.w w w 


4554593H1 


6374 


6601 


59 


264633.8.dec 


3645638H1 


6726 


7013 




264633 8 dec 


260539H1 

■WWWWW 1 ■ • 


6383 


6707 


59 


264633.8.dec 


1891273H1 


6729 


6974 


59 

WW 


264633 8 dec 


5566205H1 

WW W W""W*rf • ■ ■ 


2764 


3004 


59 


264633.8.dec 


946530H1 


6745 


6979 


59 


264633.8.dec 


6357475H1 


3978 


4212 


59 


264633.8.dec 


g1011156 


6786 


7036 


59 


264633 8 dec 


6327349H1 


3978 


4221 


59 


264633.8.dec 


6593361 H1 


6814 


7223 


59 


264633.8.dec 


g775274 


5955 


6067 


59 


264633.8.dec 


g4243143 


6816 


7273 


59 


264633.8.dec 


a3249712 

M w*» ■ W' * ■ •*>> 


273 


7688 


59 


264633.8.dec 


4212002H1 


6845 


7124 


59 

WW 


264633 8 dec 

4b_**r i WWW *m" 


Q319009 

y wwww 


1369 


1645 


59 


264633.8.dec 


1489550F6 


6537 


7023 


59 


264633.8.dec 


g1 921 891 


1499 


1947 


59 


264633.8.dec 


1489550H1 


6537 


6790 


59 


264633 8 dec 


g359561 9 


3924 


4355 


59 


264633.8.dec 


4501054H1 


6558 


6794 


59 


264633.8.dec 


g4888293 


3925 


4392 


59 


264633.8.dec 


30441 25H1 


6579 


6853 


59 


264633.8.dec 


009055T6 


7360 


7664 


59 


264633.8.dec 


5452322H1 


6584 


6835 


59 


264633 8 dec 


g4070420 


4406 


4896 


59 


264633.8.dec 


g1 01 0248 


6586 


6884 


59 

WW 


264633 8 dec 

■■W ■ www«v iWW 


a3803677 


4406 


4802 


59 


264633.8.dec 


g1010260 


6587 


6825 


59 


264633.8.dec 


g4895931 


4424 


4889 


59 


264633.8.dec 


1470385F6 


6641 


7087 


59 


264633.8.dec 


g5595323 


3970 


4119 


59 


264633.8.dec 


1470385H1 


6641 


6830 


59 


264633.8.dec 


287251 5H1 


4676 


4866 


59 


264633.8.dec 


3606436H1 


6642 


6866 


59 


264633.8.dec 


1740929H1 


4711 


4919 


59 


264633.8.dec 


5025468H1 


6675 


6967 


59 


264633.8.dec 


g 1524770 


4708 


5139 


59 


264633.8.dec 


5150604H1 


6679 


6934 


59 


264633.8.dec 


20231 82H1 


4743 


4996 


59 


264633.8.dec 


1456829H1 


6687 


6916 


59 


264633.8.dec 


g 1734256 


3862 


3955 


59 


264633.8.dec 


5913938H1 


6529 


6783 


59 


264633.8.dec 


g2934347 


3910 


4337 


59 


264633.8.dec 


g2254985 


7461 


7682 


59 


264633.8.dec 


g2433925 


3739 


3962 


59 


264633.8.dec 


g4378013 


33 


7691 


59 


264633.8.dec 


6544483H1 


3772 


4274 


59 


264633.8.dec 


g 1976686 


74 


399 


59 


264633.8.dec 


g4395405 


3818 


3955 


59 


264633.8.dec 


335861 2H1 


6392 


6669 
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59 


264633. 8. dec 


g990828 


6479 


6830 


61 


902943.1 .dec 


6490801 H1 


2516 


3033 


59 


264633.8.dec 


4522981 H1 


4965 


5221 


61 


902943.1 .dec 


6489463H1 


2547 


3084 


59 


264633.8.dec 


g2255600 


7326 


7680 


61 


902943.1 .dec 


g2012148 


2641 


2972 


59 


264633.8.dec 


g5446131 


7336 


7683 


61 


902943.1. dec 


6486889H1 


2688 


3182 


59 


264633.8.dec 


g4222673 


7348 


7681 


61 


902943.1. dec 


6482904H1 


2760 


3106 


59 


264633.8.dec 


g3043107 


3694 


3955 


61 


902943.1 .dec 


587321 3H1 


2866 


3167 


59 


264633.8.dec 


5731062H1 


6047 


6250 


61 


902943.1. dec 


5873221 H1 


2870 


3167 


59 


264633.8.dec 


3584229H1 

xar x.*" x>* w m a ■■» XX ■ ■ 9 


6063 


6331 


61 


902943.1 .dec 


643261 5H1 


3109 


3555 


59 


264633.8.dec 


g2014008 


6133 


6423 


61 


902943.1 .dec 


6354243H1 


3330 


3608 


59 


264633.8.dec 


626375H1 

X**" mm* X^ XX * VI ■ » 


6029 


6230 


61 


902943.1 .dec 


5637909H1 


3334 


3554 


59 


264633.8.dec 


62651 9H1 


6029 


6136 


61 


902943.1 .dec 


5644969 H1 


3344 


3574 


59 


264633.8.dec 

X»» W XX XXXX * XX • XM XX XX 


1489550T6 

■ rv/wwv p xx 


7421 


7653 


61 


902943.1. dec 


4457090H1 


3364 


3563 


59 


264633 8 dec 

Wm XjX F XX X^ X F • X F a «— * X^ 


5478430H1 

XX • 9 X^ T XX XXI * P 


5957 


6209 


61 


902943.1. dec 


564251 4H1 


3386 


3612 


59 


264633 8.dec 

■■■■•■XiX ■ xxxxxx • xx • x»*xa»* xx 


g900129 


5978 


6314 


61 


902943.1. dec 


6170624H1 


3511 


3821 


59 


264633.8. dec 

CpvxX » V Wt XjX * X^ X»* XX 


0908380 

V-B Xp* XX XXXX xX x^ 


6015 


6283 


61 


902943.1. dec 


5092668F6 


3637 


4104 


59 


264633 8 dec 


5889703 H1 


6260 


6456 


61 


902943.1. dec 


5092668H1 


3637 


3784 


59 


264633.8.dec 

• XX ■ XX X* Xj^ » xx • x« xx xx 


5882091 H1 


6260 


6342 


61 


902943.1. dec 


4455573H1 


3665 


3908 


59 


264633 8 dec 

m. ■ XX XX XX » XX • Xr4 XX XX 


58841 67H1 

XXXXXX • W XX • ■ P ■ 


6261 


6515 


61 


902943.1. dec 


241 5341 H1 


3936 


4126 


59 


264633.8.dec 


g3797785 


4931 


5290 


61 


902943.1. dec 


241 5341 F6 


3936 


4297 


59 


264633 8 dec 

«■■» XX ■ ^.XXb^XjX • X^ m XX XX XX 


a1 68971 8 

XJ ■ X^ X^ XX 4 ■ x«x 


4957 


5286 


61 


902943.1. dec 


5644569R8 


4000 


4363 


59 


264633.8.dec 

^■■r xx » xxx^xx » xx • xxj xx XX 


g3015912 


4959 


5290 


61 


902943.1 .dec 


563781 1H1 


3999 


4232 


59 


264633.8.dec 


60081 18H1 


7417 


7680 


61 


902943.1 .dec 


5092668R6 


4232 


4744 


59 


264633.8.dec 


2556025H1 


7412 


7642 


61 


902943.1. dec 


4455573R8 


4303 


4499 


60 


337822.4.dec 


g3755755 


581 


888 


62 


256009.2.dec 


368631 3H1 


6803 


7062 


60 


337822.4.dec 


g667395 


768 


882 


62 


256009 .2.dec 


2895979H1 


6802 


7098 


60 


337822.4.dec 


2833833H1 


823 


1081 


62 


256009.2.dec 


2129702H1 


6804 


7076 


60 


337822.4.dec 


g395543 


941 


1212 


62 


256009.2.dec 


1655677H1 


6803 


7029 


60 


337822.4.dec 


g2825241 


1067 


1260 


62 


256009 .2.dec 


g2021489 


6804 


7082 


60 


337822. 4.dec 


g2064264 


1102 


1430 


62 


256009.2.dec 


2884943H1 


6804 


7059 


60 


337822.4.dec 


2915382H1 


7 


140 


62 


256009.2.dec 


3637254H1 


6804 


7017 


60 


337822.4.dec 


g 1269991 


26 


194 


62 


256009.2.dec 


854050H1 


6804 


7046 


60 


337822.4.dec 


g875782 


172 


415 


62 


256009.2.dec 


3243241 H1 


6803 


7045 


60 


337822.4.dec 


g3283993 


210 


1576 


62 


256009 .2.dec 


3693242H1 


6804 


7085 


60 


337822.4.dec 


g51 10760 


423 


884 


62 


256009 ,2.dec 


609587H1 


6804 


7050 


60 


337822 4.dec 

XX XX • X«T 1mm «m « ■ • X«P Xb" 1 X«f 


g3753613 


474 


883 


62 


256009.2.dec 


g4664567 


6805 


7221 


60 


337822.4.dec 


474051 5H2 


558 


827 


62 


256009.2.dec 


2172480H1 


6807 


7062 


60 


337822.4.dec 


4738945H1 


558 


705 


62 


256009.2.dec 


2228032H1 


6804 


7033 


60 


337822 4 dec 


2080490H1 


1140 


1413 


62 


256009.2.dec 


5915452H1 


6804 


7079 


60 


337822 4 dec 

XmTKJ i xX«V«b.4»»» • T * X^t XX XX 


3280776H1 


1 


181 


62 


256009.2.dec 


1373349H1 


6804 


7037 


60 


337822 4 dec 


2915382F6 


1 


318 


62 


256009.2.dec 


3433122H1 


6803 


7045 


60 


337822. 4.dec 

%X^X^ 9 ^mPmmmUmm* V pXjXXXXX 


325291 1H1 


5 


250 


62 


256009 .2.dec 


5293637H2 


6808 


7047 


61 


902943 1 dec 

X^X^&mXV rV* ■ flXXXXXX 


5638959H1 


1 


209 


62 


256009.2.dec 


2272963H1 


6806 


7072 


61 


902943 1 dec 


6491718H1 

w ■ XX * * ■ ^X P ■ ■ 


157 


634 


62 


256009 .2.dec 


1687974H1 


6806 


7053 


61 


902943.1. dec 

X^ X^ ■ X^ « XjX * ■ a XX X^ XX 


2415341T6 


518 


1042 


62 


256009.2.dec 


2317928H1 


6806 


7019 


61 


902943.1. dec 


6428735H1 


867 


1184 


62 


256009.2.dec 


854024H1 


6804 


6999 


61 


902943 1 dec 

v Vx-v ■ xx *■ ■ • x«p x*#" xbf 


g2834117 


922 


1236 


62 


256009 .2.dec 


1616977H1 


6805 


7024 


61 


902943.1. dec 


6426168H1 


943 


1312 


62 


256009.2.dec 


726262H1 


6805 


7031 


61 


902943.1 .dec 


5639272H1 


1027 


1265 


62 


256009.2.dec 


32541 75H1 


6805 


7027 


61 


902943.1. dec 


564251 4R8 


1089 


1441 


62 


256009.2.dec 


4950837H1 


6806 


7083 


61 


902943.1 .dec 


6492034H1 


1167 


1686 


62 


256009 .2.dec 


3098257H1 


6805 


7101 


61 


902943.1. dec 


358895H1 


1444 


1689 


62 


256009 .2.dec 


3015006H1 


6806 


7089 


61 


902943.1. dec 


6487975H1 


1578 


2076 


62 


256009 .2.dec 


1544438R6 


6806 


7187 


61 


902943.1. dec 


5091951 H1 


1586 


1649 


62 


256009.2.dec 


1227986H1 


6806 


7043 


61 


902943.1. dec 


6264350H1 


1694 


1873 


62 


256009.2.dec 


272961 7H1 


6806 


7063 


61 


902943.1. dec 


4455276F6 


1739 


2225 


62 


256009.2.dec 


1385926H1 


6807 


7020 


61 


902943.1. dec 


4455276H1 


1739 


1979 


62 


256009.2.dec 


2079647H1 


6806 


7056 


61 


902943.1. dec 


508981 0H1 


1977 


2265 


62 


256009.2.dec 


6543764H1 


6809 


7200 


61 


902943.1. dec 


6344967H1 


2005 


2274 


62 


256009.2.dec 


g1 954438 


6809 


7194 


61 


902943.1 .dec 


6171524H1 


2010 


2321 


62 


256009.2.dec 


1293789H1 


6808 


7027 


61 


902943.1. dec 


6005774H1 


2069 


2358 


62 


256009.2.dec 


1417792H1 


6808 


7022 


61 


902943.1. dec 


3426344F6 


2093 


2452 


62 


256009.2.dec 


2741429H1 


6808 


7060 


61 


902943.1. dec 


3426344H1 


2095 


2334 


62 


256009.2-dec 


3391859H1 


6806 


7099 


61 


902943.1. dec 


5642539H1 


2204 


2370 


62 


256009.2.dec 


4083084H1 


6806 


7065 


61 


902943.1. dec 


6427209H1 


2208 


2810 


62 


256009.2.dec 


75891 3H1 


6809 


7082 


61 


902943.1 .dec 


5627532H1 


2226 


2466 


62 


256009.2.dec 


2730231 H1 


6806 


7041 


61 


902943.1. dec 


g2243717 


2295 


2559 


62 


256009.2.dec 


605554H1 


6808 


7050 


61 


902943.1. dec 


4284624H1 


2302 


2557 


62 


256009.2.dec 


9241 08H1 


6809 


7019 
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62 


256009. 2. dec 


599887H1 


6806 


7048 


62 


256009.2.dec 


2079972H1 


6809 


7072 


62 


256009.2.dec 


1293789F1 


6808 


7215 


62 


256009. 2.dec 


3168974H1 


6809 


7005 


62 


256009.2xlec 


226661 5H1 


6808 


7046 


62 


256009.2.dec 


6541086H1 


6809 


7215 


62 


256009.2.dec 


1581702H1 


6809 


7005 


62 


256009.2.dec 


1915024H1 


6809 


7052 


62 


256009.2.dec 


22423 19H1 


6808 


7057 


62 


256009.2.dec 


3010414H1 
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6813 


7075 


62 


256009.2.dec 

* X^ X^ X# X^ X^ » fc» • X^ X^ X* 


3943490H1 


6810 


7074 


62 


256009.2.dec 


991986H1 


6813 


7101 


62 


256009.2.dec 


1316820H1 


6809 


7037 


62 


256009 .2 .dec 


1416160H1 


6813 


7052 


62 


256009 .2.dec 


1316792H1 


6809 


7053 


62 


256009.2.dec 


207971 6H1 


6813 


6972 


62 


256009 ,2.dec 


3406855H1 


6810 


7071 


62 


256009.2.dec 


g4890156 


6813 


7219 


62 


256009.2.dec 


819837H1 


6810 


7047 


62 


256009. 2. dec 


5297941 H1 


6814 


6930 


62 


256009.2.dec 


862304H1 


6809 


7061 


62 


256009.2.dec 


3220925H1 


6813 


7119 


62 


256009.2.dec 


2276745H1 


6809 


7082 


62 


256009.2.dec 


g 1270234 


6813 


7137 


62 


256009.2.dec 


990385H1 


6809 


7130 


62 


256009.2.dec 


2346929H1 


6813 


7016 


62 


256009.2.dec 


2121103H1 


6811 


7064 


62 


256009.2.dec 


5391711H1 


6813 


7101 


62 


256009.2.dec 


3049389H1 


6809 


7042 


62 


256009. 2. dec 


2230794H1 


6813 


7026 


62 


256009 .2.dec 


3601234H1 


6810 


7105 


62 


256009.2.dec 


218469H1 


6814 


7049 


62 


256009.2.dec 


908681 H1 


6809 


7101 


62 


256009.2.dec 


4373191 H1 


6523 


6774 


62 


256009.2.dec 


3880787H1 


6810 


7086 


62 


256009.2.dec 


871243H1 


6525 


6764 


62 


256009.2.dec 


3011719H1 


6810 


7088 


62 


256009.2.dec 


2972356H1 


6534 


6661 


62 


256009 .2.dec 


2600096H1 


6810 


7088 


62 


256009.2.dec 


463304H1 


6541 


6702 


62 


256009 .2.dec 


2924309H1 


6809 


7080 


62 


256009.2.dec 


4625565F6 


6555 


6902 


62 


256009.2.dec 


5791558H1 


6809 


7085 


62 


256009 .2.dec 


5114016H1 


6556 


6845 


62 


256009 .2.dec 


3283883H1 


6810 


7051 


62 


256009.2.dec 


3947921 H1 


6572 


6743 


62 


256009.2.dec 


4153912H1 


6809 


7076 


62 


256009.2.dec 


1682252F7 


6575 


6865 


62 


256009. 2.dec 


3047406H1 


6810 


7104 


62 


256009.2.dec 


1682252H1 


6575 


6750 


62 


256009.2.dec 


5885375H1 


6810 


7062 


62 


256009.2. dec 


1682229H1 


6575 


6757 


62 


256009.2.dec 


423981 0H1 


6810 


6989 


62 


256009.2.dec 


2932355H1 


6581 


6811 


62 


256009.2.dec 


58931 5H1 


6809 


7048 


62 


256009.2.dec 


6408071 H1 


6586 


6769 


62 


256009.2. dec 


4185609H1 


6809 


7107 


62 


256009.2.dec 


1 499581 H1 


6585 


6793 


62 


256009.2.dec 

hB X^ X^ X^ X^ • » X* X^ X^ 


2812373H1 


6809 


7072 


62 


256009.2.dec 


4362986H1 


6590 


6842 


62 


256009.2.dec 


2520870H1 


6811 


7052 


62 


256009 .2.dec 


504481 1H1 


6592 


6730 


62 


256009.2 dec 

■j^k x^ X^ X^ X^ XX » *— ■ X^ x^ x^ 


3884579H1 


6810 


7043 


62 


256009.2.dec 


3706863H1 


6592 


6885 


62 


256009.2.dec 


1845847H1 


6809 


6995 


62 


256009.2.dec 


2659266H1 


6599 


6829 


62 


256009.2.dec 


2708924H1 


6809 


7122 


62 


256009.2.dec 


4359709H1 


6599 


6852 


62 


256009.2.dec 


4046820H1 


6809 


7089 


62 


256009 ,2.dec 


537571 0H1 


6602 


6861 


62 


256009.2.dec 


1 96091 2H1 


6809 


7085 


62 


256009 .2.dec 


4109048H1 


6620 


6900 


62 


256009 .2.dec 


3049275H1 


6810 


7097 


62 


256009 .2.dec 


3740282T6 


6621 


7168 


62 


256009.2.dec 


321 5251 H1 


6810 


7100 


62 


256009 .2.dec 


1227552H1 


6624 


6832 


62 


256009.2.dec 


4365729H1 


6811 


7073 


62 


256009.2.dec 


1263359H1 


6641 


6909 


62 


256009 .2.dec 


4117120H1 


6814 


6975 


62 


256009.2.dec 


1263359R1 


6642 


7181 


62 


256009.2.dec 


g21 94573 


6811 


7181 


62 


256009 .2.dec 


1601346T6 


6643 


7179 


62 


256009 .2.dec 


5091046H1 


6811 


7076 


62 


256009.2.dec 


5286950H1 


6645 


6822 


62 


256009.2.dec 


1857590H1 


6811 


7073 


62 


256009 .2.dec 


6587440H1 


6645 


7200 


62 


256009 .2.dec 


1695556H1 


6811 


7024 


62 


256009.2.dec 


3988491 H1 


6650 


6925 


62 


256009.2.dec 


869832H1 


6811 


7070 


62 


256009.2.dec 


6396383H1 


6650 


6960 


62 


256009.2.dec 


2318919H1 


6811 


7078 


62 


256009.2.dec 


2121123H1 


6651 


6897 


62 


256009.2.dec 


1693022H1 


6811 


7017 


62 


256009.2.dec 


3963377H1 


6651 


6936 


62 


256009.2.dec 


1 27381 2F1 


6812 


7215 


62 


256009.2.dec 


3965396H1 


6651 


6842 


62 


256009.2.dec 


23671 76H1 


6810 


7031 


62 


256009.2.dec 


5281655H1 


6658 


6873 


62 


256009.2.dec 


824777H1 


6811 


7127 


62 


256009.2.dec 


3170987H1 


6666 


6925 


62 


. 256009.2.dec 


31 14091 H1 


6810 


7084 


62 


256009.2.dec 


1260390H1 


6666 


6901 


62 


256009 .2.dec 


824777T1 


6811 


7173 


62 


256009. 2. dec 


5577548H1 


6668 


6923 


62 


256009.2.dec 


g2265297 


6811 


7217 


62 


256009.2.dec 


818983R1 


6671 


7223 


62 


256009 .2.dec 


704635H1 


6811 


7037 


62 


256009.2.dec 


818983T1 


6671 


7176 


62 


256009 .2.dec 


5090855H1 


6811 


7086 


62 


256009. 2.dec 


818983H1 


6671 


6859 


62 


256009.2.dec 


1496555H1 


6811 


7034 


62 


256009.2.dec 


2049005H1 


6672 


6955 
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62 


256009.2.dec 


870733H1 


6672 


6917 


62 


256009.2.dec 


1301095H1 


6800 


7056 


62 


256009.2.dec 


990641 H1 


6674 


6982 


62 


256009.2.dec 


3389837H1 


6800 


7061 


62 


256009.2.dec 


947048H1 


6677 


6832 


62 


256009.2-dec 


422981 1H1 


6800 


7084 


62 


256009.2.dec 


206785H1 


6677 


6908 


62 


256009.2.dec 


g 1365272 


6801 


7224 


62 


256009.2.dec 


g 1506961 


6694 


6872 


62 


256009 .2.dec 


933491T1 


6800 


7177 


62 


256009.2.dec 


5006831 H1 


6694 


6807 


62 


256009.2.dec 


1955431 H1 


6800 


7075 


62 


256009. 2. dec 


369971 3H1 


6694 


6856 


62 


256009.2.dec 


22651 55H1 


6801 


7070 


62 


256009.2.dec 


4824777H1 


6694 


6925 


62 


256009.2.dec 


3352558H1 


6801 


7093 


62 


256009.2.dec 


4407873H1 


6694 


6956 


62 


256009.2.dec 


4242674H1 


6800 


6987 


62 


256009.2.dec 


3706839H1 


6696 


6968 


62 


256009.2.dec 


555446H1 


6802 


7043 


62 


256009.2.dec 


663677H1 


6696 


6935 


62 


256009.2.dec 


933491 H1 


6800 


7072 


62 


256009.2.dec 


1456470H1 


6698 


6972 


62 


256009.2.dec 


2972455H2 


6801 


7098 


62 


256009.2.dec 


968683H1 


6703 


6976 


62 


256009.2.dec 


3726151 H1 


6802 


6923 


62 


256009.2.dec 

XX XX XX XX m € • XJ XX XX 


6221267H1 


6704 


6979 


62 


256009.2.dec 


3369242H1 


6802 


7046 


62 


256009 2 dec 

^» X» XX xx X^ « ^» ■ X^ XX *X 


20991 29H1 

^X XX XX • XX • ■ ■ 


6706 


6960 


62 


256009 .2.dec 


4880939H1 


6802 


7080 


62 


256009.2.dec 


2429628H1 


6706 


6939 


62 


256009.2.dec 


953686R1 


6802 


7215 


62 


256009.2.dec 


895220R1 


6707 


7113 


62 


256009.2.dec 


1445580H1 


6802 


7043 


62 


256009.2.dec 


895220H1 


6707 


6926 


62 


256009.2.dec 


2530970H1 


6802 


7032 


62 


256009.2.dec 


2971992H1 


6709 


7001 


62 


256009.2.dec 


2639476H1 


6802 


7050 


62 


256009.2.dec 


2354602H1 


6713 


6935 


62 


256009.2.dec 


953686H1 


6802 


7045 


62 


256009.2.dec 


3441955H1 


6713 


6959 


62 


256009.2.dec 


4507544H1 


6802 


7079 


62 


256009.2.dec 


3704091 H1 


6713 


6980 


62 


256009.2.dec 


37261 82H1 


6802 


7055 


62 


256009.2.dec 


3705591 H1 


6714 


6991 


62 


256009 .2.dec 


g1 390836 


6803 


7209 


62 


256009.2.dec 


3481169H1 


6714 


6854 


62 


256009. 2. dec 


4815838H1 


6802 


7057 


62 


256009.2.dec 


2647507H1 


6717 


6971 


62 


256009.2.dec 


2508350H1 


6803 


7057 


62 


256009 .2.dec 


2260986H1 


6722 


6943 


62 


256009.2.dec 


2061029H1 


6802 


7061 


62 


256009.2.dec 


2366005H1 


6723 


6956 


62 


256009.2.dec 


2910492H1 


6803 


7052 


62 


256009.2.dec 


344692H1 


6730 


6929 


62 


256009.2.dec 


2061029R6 


6802 


7124 


62 


256009.2.dec 


4629622H1 


6730 


6990 


62 


256009.2.dec 


g51 12634 


6803 


7218 


62 


256009.2.dec 


4625565T6 


6751 


7201 


62 


256009.2.dec 


3282530H1 


6804 


7046 


62 


256009.2.dec 


4383604H1 


6755 


7001 


62 


256009.2.dec 


48791 37H1 


5767 


6051 


62 


256009.2.dec 


385999H1 


6763 


7029 


62 


256009.2.dec 


2579142H1 


5789 


6048 


62 


256009.2.dec 


2913295H2 


6764 


7035 


62 


256009.2.dec 


3286303H2 


5797 


6063 


62 


256009.2.dec 


478883H1 


6764 


7049 


62 


256009.2.dec 


3405455H1 


5802 


6041 


62 


256009.2.dec 


3287375H1 


6767 


7010 


62 


256009.2.dec . 


6521704H1 


5808 


5885 


62 


256009.2.dec 


6311868H1 


6767 


7230 


62 


256009.2.dec 


1986283H1 


5822 


6032 


62 


256009.2.dec 


29031 20H1 


6765 


7068 


62 


256009.2.dec 


2911956H1 


5829 


6113 


62 


256009.2.dec 


6484690H1 


6769 


7215 


62 


256009.2.dec 


1 40391 3H1 


5833 


6084 


62 


256009.2.dec 


1 72741 8H1 


6779 


6987 


62 


256009.2.dec 


5437246H1 


5842 


6082 


62 


256009.2.dec 


4654772H1 


6780 


7027 


62 


256009.2.dec 


6492553H1 


5851 


6377 


62 


256009.2.dec 


666300H1 


6787 


6997 


62 


256009.2.dec 


3025734H1 


5852 


6097 


62 


256009.2.dec 


g921826 


6789 


7192 


62 


256009.2.dec 


445691 0H1 


5865 


6130 


62 


256009.2.dec 


4351905H1 


6788 


6948 


62 


256009.2.dec 


5093151 H1 


5870 


6132 


62 


256009.2.dec 


1596705H1 


6789 


7011 


62 


256009.2.dec 


3423758H1 


5871 


6126 


62 


256009.2.dec 


29981 88H1 


6789 


7046 


62 


256009.2.dec 


3761955H1 


5880 


6181 


62 


256009 ,2.dec 


2998354H1 


6789 


7047 


62 


256009.2.dec 


5301333H1 


5882 


6083 


62 


256009.2.dec 


5450729H1 


6789 


7059 


62 


256009.2.dec 


6380485H1 


5882 


6153 


62 


256009.2.dec 


1406482H1 


6789 


7024 


62 


256009.2.dec 


284271 0H1 


5882 


6164 


62 


256009.2.dec 


g864197 


6793 


7137 


62 


256009.2.dec 


4138977H1 


5885 


6162 


62 


256009.2.dec 


1880440H1 


6794 


7039 


62 


256009.2.dec 


435351 8H1 


5887 


6065 


62 


256009.2.dec 


g995262 


6795 


7200 


62 


256009.2.dec 


504891 5H1 


5897 


6153 


62 


256009.2.dec 


g1 8471 40 


6795 


7184 


62 


256009.2.dec 


4874435H1 


5903 


6170 


62 


256009.2.dec 


532891 2H1 


6797 


7039 


62 


256009.2.dec 


2552601 H1 


5913 


6166 


62 


256009.2.dec 


g2005208 


6798 


7103 


62 


256009.2.dec 


2970306H2 


5921 


6239 


62 


256009.2.dec 


862304T1 


6797 


7167 


62 


256009.2.dec 


209561 1H1 


5922 


6188 


62 


256009.2.dec 


4382949H1 


6798 


7052 


62 


256009.2.dec 


2253545H1 


5921 


6158 


62 


256009.2.dec 


1209525T1 


6797 


7174 


62 


256009.2.dec 


3404668H1 


5922 


6161 


62 


256009.2.dec 


545331 H1 


6796 


7032 


62 


256009.2.dec 


3886564H1 


5927 


6163 


62 


256009. 2.dec 


407031 8H1 


6801 


6924 


62 


256009.2.dec 


5550359H1 


5929 


6178 


62 


256009 .2.dec 


567575H1 


6799 


7058 


62 


256009.2.dec 


366241 H1 


5930 


6121 


62 


256009.2.dec 


21 80691 H1 


6799 


7054 


62 


256009.2.dec 


5173270H1 


5948 


6224 


62 


256009. 2. dec 


1559402H1 


6799 


7018 


62 


256009.2.dec 


3281852H1 


5965 


6211 


62 


256009.2.dec 


3487562H1 


6799 


7084 


62 


256009 -2.dec 


3678857H1 


5980 


6209 


62 


256009.2.dec 


g5397606 


6799 


7217 


62 


256009.2.dec 


3321667H1 


5992 


6276 


62 


256009.2.dec 


5182883H1 


6800 


6959 


62 


256009.2.dec 


2095264H1 


5996 


6272 


62 


256009.2.dec 


5478393H1 


6799 


7079 


62 


256009.2.dec 


4361 881 H1 


5999 


6249 


62 


256009. 2.dec 


933491 R1 


6800 


7215 


62 


256009-2.dec 


4357478H1 


6014 


6094 
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62 


256009. 2.dec 


3688656H1 


6020 


6310 


62 


256009.2.dec 


3172462H1 


6411 


6614 


62 


256009. 2. dec 


2790077H2 


6025 


6309 


62 


256009.2.dec 


425389H1 


6411 


6553 


62 


256009.2.dec 


552252H1 


6038 


6271 


62 


256009.2.dec 


1334229H1 


6411 


6630 


62 


256009.2.dec 


1601346F6 


6038 


6367 


62 


256009 .2.dec 


2187059H1 


6411 


6646 


62 


256009.2.dec 


1601346H1 


6038 


6247 


62 


256009.2.dec 


14501 13H1 


6411 


6609 


62 


256009.2.dec 


214224H1 


6053 


6232 


62 


256009.2.dec 


425476H1 


6411 


6562 


62 


256009.2.dec 


5466354H1 


6056 


6319 


62 


256009.2.dec 


4274885H1 


6411 


6606 


62 


256009.2.dec 

■jm XX XX X^ X^ XX « • »^ X^ 


4574252H1 


6065 


6322 


62 


256009.2.dec 


4551 57H1 


6411 


6632 


62 


256009.2 dec 

XX XX X^ XX xx »w * XX XX XX 


5505467H1 

XX XX XX XX XX P 1 » 1 


6070 


6309 


62 


256009.2.dec 


4359206H1 


6420 


6698 


62 


256009 2 dec 


62461 93H1 


6091 


6605 


62 


256009.2.dec 


4181068H1 


6422 


6691 


62 


256009 2 dec 


453981 7H1 

»^xx^Wx#Vx (flit 


6114 


6250 


62 


256009.2.dec 


63251 61 H1 


6424 


6643 


62 


256009 2 dec 


641 1666H1 


6119 


6560 


62 


256009.2.dec 


2310850H1 


6424 


6685 


62 

W£_ 


256009 2 dec 


6372383H1 

VVl fc»\xxxwl » ■ 


6125 


6313 


62 


256009.2.dec 


26721 63H1 


6430 


6671 


62 


256009 2 dec 


42741 5H1 


6138 


6336 


62 


256009.2.dec 


3984431 H1 


6433 


6712 


62 


256009 2 dec 


2121658H1 

t_ I £v I WWW! t 1 


6152 


6384 


62 


256009.2.dec 


5175841H1 


6434 


6707 


62 


256009 2 dec 


5333389 H1 

w Wwwwvi • ■ 


6152 


6373 


62 


256009.2.dec 


3740961 H1 


6436 


6744 


62 


256009 2 dec 


1 726084 H1 

■ 9 *^WW~I • ■ 


6153 


6374 


62 


256009.2.dec 


3528730H1 


6437 


6712 


62 


256009 2 dec 


1 726084F6 


6153 


6674 


62 


256009.2.dec 


2250643H1 


6439 


6695 


62 

W £_ 


256009 2 dec 


4291 681 H1 


6154 


6329 


62 


256009.2.dec 


911369H1 


6457 


6580 


62 


256009 2 dec 


387731 H1 

WW f » XX 1 1 1 1 


6156 


6384 


62 


256009.2-dec 


2782975H1 


6458 


6717 


62 


256009 2 dec 

JUwwVWW - ^— • XX XX XX 


446703H1 


6156 


6384 


62 


256009.2.dec 


g2156125 


6454 


6864 


62 


256009 2 dec 

a— XX XX XX XX NX • g — XX 


387605 H1 

XX XX ¥ XX xx x^ ■ ■ ■ 


6158 


6384 


62 


256009.2.dec 


4645746H1 


6459 


6725 


62 


256009 2 dec 

bWV XX XX • • XX XX XX 


4171129H1 

W WW i • > XX » ■ ■ 


6159 


6384 


62 


256009.2.dec 


37901 44H1 


6462 


6683 


62 


256009 .2. dec 


71 2351 H1 


6166 


6378 


62 


256009.2.dec 


3098026H1 


6464 


6766 


62 


256009 .2 .dec 

IwVW V VV X^ X^ X* 


4895530H1 


6168 


6467 


62 


256009.2.dec 


3786369H1 


6464 


6752 


62 


256009. 2. dec 

XX XX XX XX X^ ■ » X* x^ x^ 


2840332H1 


6169 


6384 


62 


256009.2.dec 


g21 11938 


6463 


6898 


62 


256009.2.dec 


6493278H1 


6178 


6700 


62 


256009.2.dec 


43021 40H1 


6491 


6732 


62 


256009.2.dec 


460821 0H1 


6185 


6445 


62 


256009.2.dec 


3441270H1 


6511 


6751 


62 


256009.2.dec 


4325384H1 


6195 


6312 


62 


256009.2.dec 


g2207135 


6510 


7002 


62 


256009 2.dec 

XX XX XX X* XX » * ■ »X>* XX XX 


287961 4H1 


6197 


6499 


62 


256009.2.dec 


4881155H1 


6519 


6776 


62 


256009 2 dec 

b_ XX XX XX XX XX « b» • XX XX XX 


3884625H1 

WW 1 WX» W ' ■ • 


6203 


6467 


62 


256009.2.dec 


360503H1 


2194 


2455 


62 


256009.2.dec 


4355435H1 


6206 


6384 


62 


256009 .2.dec 


g2356298 


2458 


2825 


62 


256009 2. dec 

b» XX XX XX X* ■ fc» * XXi xxxx 


5136475H1 


6208 


6491 


62 


256009. 2.dec 


5059091 H1 


2496 


2760 


62 


256009 2 dec 

»-WfV WW ■ to* • XX XX XX 


3724538H1 

W w to_ F XXXX XX ■ • 9 


6214 


6509 


62 


256009.2.dec 


g 178282 


2535 


6589 


62 


256009 2 dec 

to- XX XX XX X^ XX »fc*<VtVV 


4881090H1 


6217 


6481 


62 


256009 .2.dec 


5629802H1 


2775 


2913 


62 


256009 2 dec 

bVWWW ■ to * XX XX Xf 


5134096H1 

XX w XX * XX XX XX ■ ■ ■ 


6231 


6484 


62 


256009.2.dec 


4778995H1 


2765 


3088 


62 


256009 2 dec 


960344H1 

w www ■ » » ■ ■ 


6237 


6384 


62 


256009 .2.dec 


g2021107 


2806 


3052 


62 

w^. 


256009 2 dec 


3426474H1 


6242 


6509 


62 


256009 .2.dec 


1521307H1 


2945 


3137 


62 


256009 2 dec 


a 1634324 

\J i xx xx rwc i 


6254 


6620 


62 


256009 .2.dec 


1521719H1 


2945 


3115 


62 


256009 2 dec 


3682063H1 

Wx*w wwi ■ ■ 


6255 


6370 


62 


256009.2.dec 


2786057H1 


2945 


3189 


62 


256009 2 dec 


4150782H1 

v P WW f W * — ■ V ■ 


6262 


6505 


62 


256009 .2.dec 


6488485H1 


3028 


3528 


62 


256009 2 dec 


3774534H1 

xx m ff r WW r w ■ ■ 


6263 


6545 


62 


256009.2.dec 


5637372H1 


3032 


3280 


62 


256009 2 dec 

ww WW xx W iL-UW Vx 


6409346H1 

V ■ WW t w» ■ ■ 


6264 


6584 


62 


256009.2.dec 


5637260H1 


3032 


3294 


62 


256009 2 dec 

£ XXXx 1 XX XX XX^ » £— • VJ WW 


2611942H1 

ImW V W W ■ ^— t » ■ 


6285 


6537 


62 


256009.2.dec 


5518962H1 


3096 


3354 


62 


256009 2 dec 

t»VV\i VW * tor - XX X** XX 


786520H1 


6288 


6498 


62 


256009.2.dec 


4454257H1 


3101 


3275 


62 


256009 2. dec 

to_XXXXXX XXXX • to_ • XX XX XX 


4010059H1 


6288 


6559 


62 


256009.2.dec 


5642023H1 


3263 


3497 


62 


256009 2 dec 

m VW XX SX ■ b ■ XXXX XX 


965650H1 

XX X^ XX XX XX XX 1 9 ■ 


6295 


6384 


62 


256009.2.dec 


1 599061 H1 


3274 


3482 


62 


256009 .2 .dec 


4327206H1 


6301 


6566 


62 


256009.2.dec 


5155543H1 


3292 


3557 


62 


256009 .2dec 


762834H1 


6310 


6491 


62 


256009.2.dec 


6078401 H1 


3318 


3615 


62 


256009.2.dec 


3235055H1 


6313 


6560 


62 


256009.2.dec 


5847863H1 


3323 


3578 


62 


256009.2.dec 

*■ XX XX XX XX X^ • ■ X^ X^ X^ 


3041143H1 


6313 


6570 


62 


256009.2.dec 


5642649H1 


3408 


3666 


62 


256009.2.dec 


1814602H1 


6314 


6558 


62 


256009.2.dec 


5520057H1 


3533 


3803 


62 


256009 .2.dec 


4373041 H1 


6317 


6385 


62 


256009.2.dec 


5642541 H1 


3556 


3787 


62 


256009.2.dec 

WW XX XX XX XX » to— * XX XX XX 


5218032H1 


6316 


6573 


62 


256009.2.dec 


3068454H1 


3577 


3894 


62 


256009 2 dec 

toV/WW WW ik-WWW 


5432981 H1 

XX IW^WW WW w ■ 


6320 


6522 


62 


256009.2.dec 


3068454F6 


3577 


4018 


62 


256009 2 dec 

to_ XX XX XX XX XX i to- iW>W 


5432864H1 


6320 


6524 


62 


256009.2.dec 


6488494H1 


3590 


4142 


62 


256009 .2.dec 


4856094H1 


6321 


6571 


62 


256009.2.dec 


320001 1H1 


3616 


3894 


62 


256009.2.dec 


1907445H1 


6336 


6568 


62 


256009.2.dec 


6479761 H1 


3619 


4153 


62 


256009.2.dec 


2645473H1 


6408 


6653 


62 


256009.2.dec 


2697253H1 


3719 


4021 


62 


256009.2.dec 


424883H1 


6411 


6568 


62 


256009.2.dec 


5522487H1 


3799 


3892 


62 


256009.2.dec 


2225449H1 


6411 


6639 


62 


256009 .2.dec 


5089970H1 


3801 


3971 


62 


256009.2.dec 


1456922H1 


6411 


6564 


62 


256009.2.dec 


634881 7H1 


3807 


4089 


62 


256009.2.dec 


3047472H1 


6411 


6688 


62 


256009.2. dec 


1257838H1 


3812 


4062 


62 


256009.2.dec 


5069778H1 


6411 


6661 


62 


256009.2.dec 


5043875H1 


3935 


4190 


62 


256009.2.dec 


2855974H1 


6411 


6610 


62 


256009.2.dec 


960370H1 


4046 


4247 


62 


256009 .2.dec 


2102283H1 


6411 


6638 


62 


256009.2.dec 


5438883H1 


4065 


4315 


62 


256009.2.dec 


427953H1 


6411 


6565 


62 


256009.2.dec 


3600706H1 


4135 


4355 
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62 


256009.2.dec 


3441916H1 


4173 


4434 


62 


256009.2.dec 


6077457H1 


5478 


5788 


62 


256009.2.dec 


5065557H2 


4193 


4446 


62 


256009.2.dec 


6077468H1 


5479 


5780 


62 


256009.2.dec 


640741 0H1 


4199 


4494 


62 


256009.2.dec 


g625007 


5478 


5734 


62 


256009.2.dec 


5644707R8 


4205 


4599 


62 


256009.2.dec 


2631148H1 


5481 


5614 


62 


256009.2-dec 


335241 4H1 


4225 


4337 


62 


256009.2.dec 


5638339H1 


5486 


5685 


62 


256009. 2. dec 


638961 0H1 


4230 


4378 


62 


256009.2.dec 


3203663H1 


5509 


5729 


62 


256009.2.dec 


5640341 H1 


4355 


4601 


62 


256009.2.dec 


32091 49H1 


5525 


5819 


62 


256009.2.dec 


2643454H1 


4375 


4621 


62 


256009.2-dec 


1673237H1 


5546 


5781 


62 


256009.2.dec 


32851 15H1 


4383 


4619 


62 


256009.2-dec 


5043372H1 


5552 


5800 


62 


256009.2.dec 


3432281 H1 


4401 


4635 


62 


256009.2-dec 


476250H1 


5565 


5817 


62 


256009.2.dec 


6113754H1 


4434 


4655 


62 


256009 .2.dec 


6513254H1 


5572 


5833 


62 


256009.2.dec 


2182372H1 


4449 


4704 


62 


256009 .2.dec 


5000485H1 


5576 


5838 


62 


256009.2.dec 


3529034H1 


4498 


4802 


62 


256009 .2.dec 


4820523H1 


5577 


5863 


62 


256009.2.dec 


6491824H1 


4541 


4783 


62 


256009 .2.dec 


28531 17H1 


5580 


5851 


62 


256009.2.dec 


6483681 H1 


4541 


5017 


62 


256009.2.dec 


5512281 H1 


5582 


5818 


62 


256009.2.dec 


2124605H1 


4574 


4874 


62 


256009 .2.dec 


899007H1 


5585 


5802 


62 


256009.2.dec 


2122823H1 


4602 


4882 


62 


256009 .2.dec 


3425387H1 


5588 


5824 


62 


256009 .2.dec 


3788796H1 


4634 


4909 


62 


256009.2.dec 


4902802H1 


5629 


5914 


62 


256009 .2.dec 


5159083H1 


4641 


4795 


62 


256009 .2.dec 


240687H1 


5640 


5819 


62 


256009.2.dec 


g610938 


4663 


5048 


62 


256009 .2.dec 


5642467H1 


5660 


5918 


62 


256009 .2.dec 


1517151H1 


4700 


4896 


62 


256009 .2.dec 


4516914H1 


5690 


5942 


62 


256009 .2-dec 


4284867H1 


4706 


5020 


62 


256009.2.dec 


5173253H1 


5691 


5967 


62 


256009.2-dec 


6076896H1 


4721 


5013 


62 


256009 .2.dec 


328511 OH 1 


5702 


5938 


62 


256009.2.dec 


4351039H1 


4776 


5046 


62 


256009.2.dec 


4254424H1 


5723 


6000 


62 


256009 .2.dec 


2162143H1 


4776 


4941 


62 


256009.2.dec 


5510107H1 


5725 


5956 


62 


256009.2-dec 


4820360H1 


4778 


5045 


62 


256009.2-dec 


29431 37H1 


5727 


6029 


62 


256009.2.dec 


34261 67H1 


4791 


4910 


62 


256009,2.dec 


1556574H1 


5729 


5934 


62 


256009,2.dec 


5522528H1 


4867 


5123 


62 


256009 .2.dec 


1603021 H1 


5754 


5956 


62 


256009.2-dec 


3424683H1 


4873 


4958 


62 


256009.2.dec 


g1 522250 


5759 


5996 


62 


256009.2.dec 


2923879H1 


4883 


5148 


62 


256009 .2.dec 


4624485H1 


5766 


6004 


62 


256009.2.dec 


g819017 


4890 


5068 


62 


256009.2.dec 


g21 11876 


6849 


7223 


62 


256009.2.dec 


1431243H1 


4899 


5143 


62 


256009.2.dec 


g2718730 


6849 


7222 


62 


256009 ,2.dec 


6523601 H1 


4945 


5442 


62 


256009.2.dec 


g 1023224 


6849 


7119 


62 


256009.2.dec 


3740282H1 


4959 


5248 


62 


256009.2.dec 


849037R1 


6849 


7209 


62 


256009.2.dec 


3740282F6 


4959 


5255 


62 


256009.2.dec 


4371 46H1 


6849 


7059 


62 


256009.2.dec 


2805285H1 


4974 


5257 


62 


256009.2.dec 


41 89371 H1 


6848 


7204 


62 


256009.2.dec 


3725454H1 


4989 


5263 


62 


256009.2.dec 


918287H1 


6849 


7127 


62 


256009.2-dec 


3126237H1 


5022 


5297 


62 


256009 .2.dec 


713272H1 


6850 


6932 


62 


256009.2.dec 


1981 13H1 


5034 


5216 


62 


256009.2.dec 


g2584215 


6850 


7215 


62 


256009 .2.dec 


5088534H1 


5057 


5320 


62 


256009 .2.dec 


g3741328 


6850 


7216 


62 


256009.2-dec 


3379972H1 


5061 


5303 


62 


256009.2-dec 


24841 09H1 


6850 


7178 


62 


256009 .2.dec 


g707978 


5088 


5372 


62 


256009.2.dec 


g3232181 


6851 


7224 


62 


256009.2.dec 


5344949H1 


5097 


5270 


62 


256009.2-dec 


g663701 


6852 


7155 


62 


256009.2.dec 


3423665H1 


5108 


5384 


62 


256009.2-dec 


3836834H1 


6853 


7125 


62 


256009.2-dec 


3095983H1 


5112 


5405 


62 


256009.2-dec 


6552959H1 


6858 


7262 


62 


256009 .2.dec 


649021 OH 1 


5138 


5484 


62 


256009.2.dec 


1973258H1 


6860 


7109 


62 


256009.2.dec 


3761375H1 


5154 


5367 


62 


256009.2.dec 


6372054H1 


6861 


7143 


62 


256009.2.dec 


3394437H1 


5161 


5422 


62 


256009 .2.dec 


49071 79H2 


6863 


7127 


62 


256009.2.dec 


4519489H1 


5182 


5435 


62 


256009.2.dec 


g3840340 


6888 


7219 


62 


256009.2.dec 


3200302H1 


5208 


5498 


62 


256009 .2.dec 


214414H1 


6863 


7111 


62 


256009.2-dec 


4353128H1 


5222 


5444 


62 


256009.2.dec 


2718318H1 


6864 


7111 


62 


256009.2.dec 


6485349H1 


5249 


5819 


62 


256009 -2.dec 


280301 1H1 


6864 


7113 


62 


256009 .2.dec 


4324278H1 


5256 


5504 


62 


256009 .2.dec 


g2279342 


6867 


7215 


62 


256009 .2.dec 


4788337H1 


5265 


5512 


62 


256009.2.dec 


g659626 


6868 


7219 


62 


256009.2.dec 


3424089H1 


5265 


5492 


62 


256009-2.dec 


611772H1 


6868 


7115 


62 


256009 .2.dec 


5201949H1 


5290 


5479 


62 


256009.2.dec 


g4573987 


6873 


7215 


62 


256009.2.dec 


5136679H1 


5302 


5585 


62 


256009.2.dec 


g1 6791 20 


6874 


7219 


62 


256009.2.dec 


3740789H1 


5330 


5547 


62 


256009.2.dec 


6296239H1 


6874 


7072 


62 


256009.2.dec 


3360092H1 


5356 


5620 


62 


256009.2.dec 


g5362892 


6876 


7223 


62 


256009 .2.dec 


5349521 H1 


5364 


5600 


62 


256009.2.dec 


g1 390671 


6876 


7123 


62 


256009.2.dec 


2399428H1 


5373 


5607 


62 


256009.2.dec 


g5056412 


6877 


7214 


62 


256009 .2.dec 


3227657H1 


5381 


5682 


62 


256009.2.dec 


g2821748 


6877 


7059 


62 


256009.2.dec 


4459036H1 


5383 


5635 


62 


256009 .2.dec 


640831 5H1 


6879 


7231 


62 


256009 .2. dec 


2844896H1 


5406 


5634 


62 


256009.2.dec 


258401 3H1 


6879 


7129 


62 


256009.2.dec 


2845249H1 


5406 


5677 


62 


256009.2.dec 


g5547977 


6881 


7215 


62 


256009.2.dec 


4949540H1 


5412 


5661 


62 


256009 .2.dec 


4820791 H1 


6880 


7149 


62 


256009.2.dec 


4923246H1 


5426 


5667 


62 


256009.2.dec 


2541637H1 


6880 


7108 
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62 


256009.2.dec 


g5233772 


6881 


7220 


62 


256009.2.dec 


5339947H1 


6972 


7215 


62 


256009.2.dec 


g2002728 


6882 


7215 


62 


256009.2.dec 


8531 97H1 


6973 


7215 


62 


256009.2.dec 


6403535H1 


6884 


7170 


62 


256009.2.dec 


g2211699 


6974 


7219 


62 


256009.2.dec 


2757642H1 


6884 


7141 


62 


256009.2.dec 


4771 63H1 


6975 


7213 


62 


256009.2.dec 


g3431012 


6885 


7215 


62 


256009.2.dec 


3519575H1 


6975 


7216 


62 


256009.2.dec 


2061029T6 


6885 


7175 


62 


256009.2.dec 


571449H1 


6975 


7220 


62 


256009. 2.dec 


1456470R1 


6886 


7215 


62 


256009. 2.dec 


4418748H1 


6975 


7214 


62 


256009. 2.dec 


g5547986 


6887 


7215 


62 


256009.2.dec 


2024855H1 


6982 


7215 


62 


256009.2.dec 


g4089043 


6887 


7219 


62 


256009.2.dec 


g1 8471 39 


6983 


7215 


62 


256009.2.dec 


2299044H1 


6888 


7123 


62 


256009. 2. dec 


g5635569 


6987 


7215 


62 


256009. 2.dec 


231439F1 


6889 


7214 


62 


256009.2.dec 


621 9701 H1 


6989 


7194 


62 


256009.2.dec 


231439H1 


6889 


7056 


62 


256009 .2.dec 


1998187H1 


6992 


7265 


62 


256009.2.dec 


g2194515 


6890 


7215 


62 


256009.2.dec 


1297531 H1 


6996 


7220 


62 


256009. 2. dec 


g4565079 


6891 


7215 


62 


256009. 2. dec 


668356H1 


6996 


7212 


62 


256009.2.dec 


2646878H1 


6892 


7149 


62 


256009. 2. dec 


1 297531 F1 


6996 


7215 


62 


256009.2.dec 


g51 52227 


6893 


7215 


62 


256009.2.dec 


2022823H1 


6996 


7215 ■ 


62 


256009. 2. dec 


g3958452 


6893 


7215 


62 


256009.2.dec 


936753H1 


6996 


7215 


62 


256009.2.dec 


4081 731 H1 


6894 


7181 


62 


256009.2.dec 


4764627H1 


6997 


7213 


62 


256009. 2. dec 


916243H1 


6896 


7214 


62 


256009.2.dec 


1251487H1 


7000 


7215 


62 


256009.2.dec 


g3446745 


6895 


7223 


62 


256009.2.dec 


2047571 H1 


7000 


7204 


62 


256009.2.dec 


916243T1 


6896 


7172 


62 


256009.2.dec 


1251487F1 


7000 


7215 


62 


256009.2.dec 


6566649H1 


6900 


7215 


62 


256009. 2.dec 


5054809H1 


7002 


7286 


62 


256009.2.dec 


916243R1 


6896 


7214 


62 


256009.2dec 


5056688H1 


7002 


7217 


62 


256009.2.dec 


g3016042 


6897 


7215 


62 


256009.2.dec 


6483288H1 


1 


470 


62 


256009.2.dec 


1844640H1 


6897 


7172 


62 


256009.2.dec 


g4196104 


362 


769 


62 


256009.2.dec 


g1 166675 


6897 


7336 


62 


256009.2.dec 


5643206H1 


425 


684 


62 


256009.2.dec 


g2224108 


6897 


7215 


62 


256009.2.dec 


g2264001 


436 


849 


62 


256009.2.dec 


4370277H1 


6898 


7136 


62 


256009.2.dec 


6487378H1 


463 


965 


62 


256009.2.dec 


630279H1 


6898 


7147 


62 


256009.2.dec 


g4330828 


485 


734 


62 


256009.2.dec 


4367694H1 


6898 


7145 


62 


256009.2.dec 


4705323H1 


541 


794 


62 


256009.2.dec 


641 4051 H1 


6900 


7222 


62 


256009.2.dec 


3072025F6 


600 


886 


62 


256009.2.dec 


4921996H1 


6898 


7168 


62 


256009.2.dec 


3072025H1 


601 


891 


62 


256009.2.dec 


3201114H1 


6908 


7187 


62 


256009.2.dec 


5090601 H1 


642 


921 


62 


256009.2.dec 


148475H1 


6909 


7122 


62 


256009 .2.dec 


2629979H1 


647 


890 


62 


256009.2.dec 


g4986417 


6913 


7215 


62 


256009.2.dec 


g535176 


650 


2250 


62 


256009.2.dec 


g5037266 


6917 


7217 


62 


256009.2.dec 


5642794H1 


762 


1011 


62 


256009.2.dec 


2295604H1 


6917 


7132 


62 


256009.2.dec 


6430548H1 


909 


1280 


62 


256009.2.dec 


g995210 


6925 


7208 


62 


256009.2.dec 


563791 8H1 


1012 


1271 


62 


256009.2.dec 


2909721 H1 


6925 


7193 


62 


256009.2.dec 


5868585H1 


1118 


1379 


62 


256009.2.dec 


647214H1 


6926 


7178 


62 


256009 .2.dec 


5631903H1 


1120 


1340 


62 


256009.2.dec 


1963727H1 


6926 


7208 


62 


256009.2dec 
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C_ X^ I V*' ■ ■ • ■ * x** x.* xx 


g5591239 


1368 


1583 


63 


231892.12.dec 


3537450H1 


257 


548 


63 


231 892.1 2.dec 


g5591232 


1369 


1583 


63 


231 892.1 2.dec 


3365926H1 


259 


491 


63 


231892 12 dec 

4_— XX I XXW* — - * »-» - X^ XX 


a3959585 


1372 


1591 


63 


231 892.1 2.dec 


713333H1 


259 


485 


63 


231892 12 dec 


02229948 


1374 


1583 


63 


231 892.1 2.dec 


2444890H1 


259 


474 


63 


231892 12 dec 

XX ■ X<* X#* W * • 


02898260 


1291 


1585 


63 


231 892.1 2.dec 


g2 165426 


256 


681 


63 


231892 12 dec 

X*/ 1 XXv£_a V 4W*XXXxX-r 


a651 121 


1378 


1563 


63 


231892.12.dec 


712519H1 


260 


317 


63 


231892 12 dec 


02115189 


1382 


1586 


63 


231 892.1 2.dec 


5941877H1 


262 


409 


63 


231892 12 dec 


01124380 


1293 


1585 


63 


231 892.1 2-dec 


3341189H1 


270 


528 


63 


231892 12 dec 


O708377 

V| * x^ vx %x * * 


1388 


1589 


63 


231892.12.dec 


2849094H1 


272 


527 


63 


231892 12 dec 


3344031 H1 


1392 


1580 


63 


231892. 12.dec 


35031 42H1 


273 


555 


63 


231892 12 dec 


g3839206 


1292 


1595 


63 


231892.12.dec 


1565396H1 


277 


483 


63 


231 892.1 2.dec 


g2229771 


1394 


1584 


63 


231892.12.dec 


3762075H1 


278 


564 


63 


231 892.1 2.dec 


g2877306 


1398 


1583 


63 


231892.12.dec 


1418549H1 


279 


453 


63 


231 892.1 2.dec 


546963H1 


1292 


1571 


63 


231892.12.dec 


1418090H1 


279 


528 


63 


231892 12.dec 


g4452672 


1295 


1583 


63 


231892.12.dec 


1881336F6 


281 


712 


63 


231892.12 dec 


g1 148108 


1296 


1589 


63 


231892.12.dec 


1418926H1 


692 


926 


63 


231892.12.dec 


627862H1 


1401 


1583 


63 


231892.12.dec 


g2558192 


692 


778 


63 


231 892.1 2.dec 


g 1833004 


1298 


1566 


63 


231892.12.dec 


1647830H1 


698 


899 


63 


231 892.1 2.dec 


2921838H1 


1298 


1569 


63 


231 892.1 2.dec 


27461 36H1 


699 


933 


63 


231892 12 dec 


01201653 


1298 


1584 


63 


231892.12.dec 


1299324H1 


700 


921 


63 


231892 12 dec 


g2054698 


1301 


1583 


63 


231892. 12.dec 


880929H1 


778 


896 


63 


231 892.1 2.dec 


g1 644781 


1301 


1607 


63 


231892.12.dec 


g21 63948 


786 


1137 


63 


231 892.1 2.dec 


5048046R6 


1307 


1582 


63 


231892.12.dec 


2901409H1 


863 


1167 


63 


231 892.1 2.dec 


2880459H1 


1312 


1583 


63 


231892.12.dec 


2458906H1 


380 


598 


63 


231892.12.dec 


g1 628778 


1312 


1594 


63 


231892.12.dec 


5908231 H1 


386 


584 


63 


231 892.1 2.dec 


g1 161956 


1316 


1588 


63 


231892.12.dec 


2662336H1 


387 


620 


63 


231 892.1 2.dec 


g21 63878 


1316 


1583 


63 


231892.12.dec 


3800385H1 


390 


651 


63 


231 892.1 2.dec 


g3803279 


1316 


1583 


63 


231892.12.dec 


g1 61 6396 


394 


663 


63 


231892.12.dec 


g3213517 


1320 


1588 


63 


231892.12.dec 


2637438H1 


405 


638 


63 


231 892.1 2.dec 


3537729H1 


30 


198 


63 


231892.12.dec 


491 0861 H2 


426 


687 


63 


231 892.1 2.dec 


1466792H1 


32 


206 


63 


231892.12.dec 


863497H1 


426 


675 


63 


231 892.1 2.dec 


g1 920921 


40 


492 


63 


231892.12.dec 


1959872H1 


429 


697 
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63 


231 892.1 2.dec 


2779339 H1 


432 


678 


63 


231892.12.dec 


g3038947 


1466 


1583 


63 


231 892.1 2.dec 


292531 3H1 


456 


725 


63 


231892.12.dec 


g2411416 


1469 


1585 


63 


231892.12.dec 


1271944H1 


464 


708 


63 


231892.12.dec 


g4874602 


1471 


1588 


63 


231892.12.dec 


1232955H1 


464 


685 


63 


231892. 12.dec 


g3148716 


1475 


1583 


63 


231 892.1 2.dec 


3223737H1 


469 


777 


63 


231 892.1 2.dec 


g3042883 


1474 


1583 


63 


231892.1 2.dec 


1 48471 2H1 


478 


638 


63 


231892.12.dec 


3745735H1 


1477 


1631 


63 


231 892.1 2.dec 


3695508H1 


493 


774 


63 


231892.12.dec 


g3110113 


1479 


1584 


63 


231892.1 2.dec 


350061 1H1 


493 


773 


63 


231 892.1 2.dec 


g3039527 


1483 


1587 


63 


231 892.1 2.dec 


3349768H1 


499 


765 


63 


231892.12.dec 


3590479H1 


1489 


1584 


63 


231892.1 2.dec 


1674505H1 


502 


714 


63 


231892.12.dec 


6028077H1 


1497 


1583 


63 


231 892.1 2.dec 


2382249H1 


559 


791 


63 


231 892.1 2.dec 


g1 696451 


1508 


1588 


63 


231892. 12.dec 


1375225H1 


587 


842 


63 


231 892.1 2.dec 


g2251530 


1515 


1582 


63 


231892.12.dec 


1375225F6 


587 


758 


63 


231892.12.dec 


g4389717 


1520 


1583 


63 


231 892.1 2.dec 


3190172H1 


635 


950 


63 


231 892.1 2.dec 


g3055876 


1521 


1583 


63 


231892.12.dec 


1574338H1 


639 


849 


63 


231 892.1 2.dec 


1881336H1 


281 


497 


63 


231892. 12.dec 


g1 189551 


667 


917 


63 


231 892.1 2.dec 


1 730571 H1 


283 


531 


63 


231892.12.dec 


2844332H1 


670 


940 


63 


231 892.1 2.dec 


3585953H1 


285 


501 


63 


231892.12.dec 


4021 45H1 


690 


926 


63 


231 892.1 2.dec 


3336679H1 


286 


535 


63 


231892.12.dec 


375351 4H1 


314 


616 


63 


231 892.1 2.dec 


360241 9H1 


294 


555 


63 


231892.12.dec 


255761 6H1 


323 


595 


63 


231 892.1 2.dec 


205361 1H1 


313 


572 


63 


231 892.1 2.dec 


1341814H1 


323 


519 


63 


231 892.1 2.dec 


2050642H1 


1214 


1457 


63 


231892.12.dec 


731683R1 


323 


669 


63 


231 892.1 2.dec 


g4076833 


1214 


1583 


63 


231892.12.dec 


g708378 


326 


660 


63 


231 892.1 2.dec 


g3433968 


1214 


1588 


63 


231892.12.dec 


g21 15395 


335 


628 


63 


231 892.1 2.dec 


g1 920922 


1215 


1583 


63 


231 892.1 2.dec 


2545989H1 


348 


530 


63 


231 892.1 2.dec 


g4264197 


1216 


1583 


63 


231892.12.dec 


2403449H1 


348 


569 


63 


231 892.1 2.dec 


g5676296 


1217 


1588 


63 


231 892.1 2.dec 


2779994H1 


349 


600 


63 


231 892.1 2.dec 


g4243864 


1220 


1583 


63 


231 892.1 2.dec 


5285550H1 


349 


461 


63 


231 892.1 2.dec 


g3895704 


1219 


1583 


63 


231892.12.dec 


2771977H1 


353 


599 


63 


231 892.1 2.dec 


g5554320 


1220 


1535 


63 


231892.12.dec 


2286386H1 


359 


596 


63 


231 892.1 2.dec 


g3917870 


1225 


1584 


63 


231892.12.dec 


2058238H1 


363 


600 


63 


231 892.1 2.dec 


g3644268 


1226 


1583 


63 


231892.12.dec 


3189574H1 


364 


660 


63 


231 892.1 2.dec 


g4269820 


1227 


1583 


63 


231892.12.dec 


g 1727293 


371 


486 


63 


231 892.1 2.dec 


1375225T6 


1227 


1543 


63 


231892.12.dec 


g 1774709 


371 


558 


63 


231 892.1 2.dec 


g4084882 


1227 


1586 


63 


231892.12.dec 


2692980H1 


374 


636 


63 


231 892.1 2.dec 


347608H1 


1230 


1468 


63 


231892.12.dec 


g2100130 


1 


209 


63 


231 892.1 2.dec 


g4649732 


1232 


1590 


63 


231892.12.dec 


6301 221 H1 


1 


116 


63 


231 892.1 2.dec 


g3869737 


1235 


1583 


63 


231 892.1 2.dec 


3576472H1 


5 


293 


63 


231 892.1 2.dec 


1638737H1 


1237 


1411 


63 


231892.12.dec 


24031 66H1 


4 


114 


63 


231 892.1 2.dec 


g2754388 


1238 


1587 


63 


231 892.1 2.dec 


1870381 H1 


6 


141 


63 


231892.12.dec 


1 636071 H1 


1237 


1464 


63 


231 892.1 2.dec 


1804755H1 


10 


237 


63 


231 892.1 2.dec 


919225H1 


863 


1143 


63 


231892.12.dec 


660621 0H1 


1 


380 


63 


231 892.1 2.dec 


3571728H1 


868 


1139 


63 


231 892.1 2.dec 


g2353927 


1281 


1583 


63 


231892.12.dec 


g651120 


868 


1151 


63 


231892.12.dec 


g4687116 


1282 


1583 


63 


231892.12.dec 


3766854H1 


872 


1173 


63 


231 892.1 2.dec 


g4990332 


1283 


1583 


63 


231892.12.dec 


1 560301 H1 


885 


1089 


63 


231 892.1 2.dec 


g3277437 


1284 


1588 


63 


231892.1 2.dec 


86971 7H1 


887 


1140 


63 


231 892.1 2.dec 


g2725785 


1286 


1583 


63 


231 892.1 2.dec 


818102H1 


905 


1139 


63 


231 892.1 2.dec 


g4004820 


1286 


1588 


63 


231892.12.dec 


2018418H1 


911 


1167 


63 


231 892.1 2.dec 


g1 73631 6 


1288 


1592 


63 


231892.12.dec 


1651958H1 


912 


1145 


63 


231 892.1 2.dec 


g1 733403 


1287 


1591 


63 


231892.12.dec 


2937249H1 


912 


1199 


63 


231892.12.dec 


g4070065 


1288 


1583 


63 


231 892.1 2.dec 


2940340H1 


912 


1178 


63 


231 892.1 2.dec 


g2910317 


1280 


1376 


63 


231 892.1 2.dec 


2955448H1 


913 


1144 


63 


231892.12.dec 


g4194161 


1288 


1587 


63 


231 892.1 2.dec 


2955448F6 


913 


1343 


63 


231 892.1 2.dec 


g2321061 


1288 


1558 


63 


231 892.1 2.dec 


1475472H1 


918 


1098 


63 


231892. 12.dec 


g4308757 


1289 


1584 


63 


231 892.1 2.dec 


1751019H1 


921 


1124 


63 


231 892.1 2.dec 


g5233103 


1283 


1591 


63 


231 892.1 2.dec 


2568696H1 


926 


1145 


63 


231892.12.dec 


g2726125 


1289 


1583 


63 


231 892.1 2.dec 


2568515H1 


926 


1190 


63 


231892.12.dec 


1311118H1 


1406 


1583 


63 


231 892.1 2.dec 


1319388H1 


926 


1150 


63 


231892.12.dec 


2245159H1 


1406 


1583 


63 


231 892.1 2.dec 


4860284H1 


1067 


1336 


63 


231892.12.dec 


387055H1 


1407 


1580 


63 


231 892.1 2.dec 


40601 1H1 


1068 


1283 


63 


231892.12.dec 


g4285750 


1434 


1586 


63 


231892.12.dec 


641 1241 H1 


1068 


1340 


63 


231892.1 2.dec 


g4394615 


1440 


1583 


63 


231 892.1 2.dec 


54241 2H1 


1069 


1314 


63 


231892.12.dec 


g4150176 


1440 


1587 


63 


231892.12.dec 


817008H1 


1075 


1347 


63 


231892.12.dec 


g1271257 


1449 


1594 


63 


231892.12.dec 


2953359T6 


1078 


1576 


63 


231 892.1 2.dec 


983370H1 


1455 


1583 


63 


231 892.1 2.dec 


987383H1 


1086 


1395 


63 


231892.12.dec 


983370T1 


1455 


1543 


63 


231892.12.dec 


g2899974 


1090 


1575 


63 


231892.12.dec 


g3037378 


1461 


1584 


63 


231892.12.dec 


4467971 H1 


1093 


1324 
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63 


231892.12.dec 


54041 8H1 


1093 


1335 


63 


231 892.1 2.dec 


2617909H1 


142 


355 


63 


231892.12.dec 


6160336H1 


1096 


1367 


63 


231892.12.dec 


6385750H1 


145 


430 


63 


231892.1 2.dec 


g1 349663 


1098 


1579 


63 


231892.12.dec 


g2240872 


156 


399 


63 


231892.12.dec 


3391333H1 


1100 


1326 


63 


231892.12.dec 


31 80671 H1 


163 


436 


63 


231892.12.dec 


g1 289604 


1107 


1590 


63 


231 892.1 2.dec 


3255568H1 


169 


421 


63 


231892. 12.dec 


g5178166 


1102 


1584 


63 


231 892.1 2.dec 


1785058H1 


176 


268 


63 


231 892.1 2.dec 


1 886761 H1 


1105 


1369 


63 


231892.12.dec 


3455258H1 


182 


463 


63 


231 892.1 2.dec 


1 878271 H1 


1024 


1294 


63 


231 892.1 2.dec 


g3334898 


182 


1583 


63 


231892. 12.dec 


3188440H1 


1035 


1281 


63 


231892.12.dec 


284621 3H1 


187 


448 


63 


231 892.1 2.dec 


1736789H1 


1034 


1225 


63 


231892.12.dec 


1 532971 H1 


189 


398 


63 


231892.12.dec 


1557363H1 


1035 


1197 


63 


231892.12.dec 


g1 20361 9 


191 


520 


63 


231892.12.dec 


1850596T6 


1036 


1546 


63 


231892.12.dec 


g847234 


195 


428 


63 


231 892.1 2.dec 


1853066H1 


1035 


1264 


63 


231892.12.dec 


878859H1 


200 


452 


63 


231 892.1 2.dec 


63361 5H1 


1035 


1227 


63 


231892.12.dec 


g1 6971 21 


196 


562 


63 


231 892.1 2.dec 


1735065H1 


1035 


1218 


63 


231892.12.dec 


2291868H1 


200 


385 


63 


231892.12.dec 


2650227H1 


1035 


1229 


63 


231892. 12.dec 


2615766H1 


208 


484 


63 


231 892.1 2.dec 


2650258H1 


1035 


1240 


63 


231892.12.dec 


4440531 H1 


216 


410 


63 


231 892.1 2.dec 


4596520H1 


1035 


1261 


63 


231892.12.dec 


g1 733402 


224 


342 


63 


231 892.1 2.dec 


3540242H1 


1035 


1269 


63 


231892. 12.dec 


6169874H1 


229 


408 


63 


231 892.1 2.dec 


275931 OH 1 


1035 


1234 


63 


231 892.1 2.dec 


1 444781 H1 


230 


445 


63 


231892.12.dec 


5186822H1 


1039 


1203 


63 


231892.12.dec 


3563740H1 


238 


518 


63 


231 892.1 2.dec 


2989463T6 


1040 


1546 


63 


231892.12.dec 


5678361 H1 


239 


417 


63 


231892. 12.dec 


3801 691 H1 


1042 


1341 


63 


231 892.1 2.dec 


2484857H1 


241 


484 


63 


231892.12.dec 


3802491 H1 


1042 


1380 


63 


231 892.1 2.dec 


855867H1 


248 


469 


63 


231892.1 2.dec 


3089740H1 


1047 


1322 


63 


231892.12.dec 


5626936H1 


248 


450 


63 


231892.12.dec 


5340828H1 


1048 


1325 


63 


231 892.1 2.dec 


g1 3031 50 


248 


560 


63 


231892. 12.dec 


31 28579 H1 


1048 


1340 


63 


231 892.1 2.dec 


g41 75728 


1239 


1589 


63 


231 892.1 2.dec 


349802H1 


1051 


1317 


63 


231892.12.dec 


g2659270 


1238 


1376 


63 


231 892.1 2.dec 


1469950H1 


1055 


1155 


63 


231892.12.dec 


g41 13163 


1239 


1586 


63 


231 892.1 2.dec 


5605748H1 


1057 


1297 


63 


231 892.1 2.dec 


6296673H1 


1241 


1542 


63 


231 892.1 2.dec 


4244841 H1 


1057 


1310 


63 


231892.12.dec 


g41 94287 


1241 


1587 


63 


231 892.1 2.dec 


351340H1 


1060 


1179 


63 


231 892.1 2.dec 


g3399839 


1246 


1583 


63 


231892.12.dec 


1977654H1 


1059 


1338 


63 


231892.12.dec 


1958239H1 


1249 


1507 


63 


231 892.1 2.dec 


1418488H1 


1432 


1580 


63 


231 892.1 2.dec 


4671164H1 


1249 


1512 


63 


231892.12.dec 


3575324H1 


59 


350 


63 


231892.12.dec 


g3428911 


1249 


1405 


63 


231 892.1 2.dec 


343771 6H1 


57 


298 


63 


231892.12.dec 


1927463H1 


1249 


1368 


63 


231 892.1 2.dec 


2696837H1 


56 


354 


63 


231 892.1 2.dec 


g1 61 6285 


1250 


1488 


63 


231 892.1 2.dec 


1892118H1 


56 


289 


63 


231892.12.dec 


g3918480 


1251 


1587 


63 


231892.12.dec 


3347754H1 


54 


299 


63 


231892.12.dec 


g3279206 


1254 


1590 


63 


231 892.1 2.dec 


2264065H1 


55 


316 


63 


231892.12.dec 


4551689H1 


1255 


1474 


63 


231892.12.dec 


1478070H1 


54 


314 


63 


231 892.1 2.dec 


g2784478 


1254 


1376 


63 


231892.12.dec 


3082709H1 


56 


351 


63 


231892.12.dec 


2426256H1 


1255 


1493 


63 


231892.12.dec 


1341563H1 


56 


278 


63 


231892.12.dec 


g3694470 


1257 


1586 


63 


231892.12.dec 


1 855681 H1 


56 


326 


63 


231 892.1 2.dec 


g826517 


1264 


1592 


63 


231892.12.dec 


2526504H1 


56 


287 


63 


231 892.1 2.dec 


g3000871 


1264 


1582 


63 


231 892.1 2-dec 


5043567H2 


57 


316 


63 


231892.12.dec 


g4599945 


1264 


1582 


63 


231892.12.dec 


2463905H1 


57 


300 


63 


231 892.1 2.dec 


2827756H1 


1266 


1546 


63 


231 892.1 2.dec 


3077148H1 


57 


320 


63 


231892.12.dec 


5854044H1 


1266 


1528 


63 


231 892.1 2.dec 


3239038H1 


57 


299 


63 


231892.12.dec 


g4391003 


1267 


1583 


63 


231892.12.dec 


649721 8H1 


57 


502 


63 


231 892.1 2.dec 


1 364761 H1 


1269 


1516 


63 


231892.12.dec 


3122810H1 


54 


388 


63 


231892. 12.dec 


2017403H1 


1272 


1422 


63 


231892.12.dec 


2434043H1 


60 


282 


63 


231892.12.dec 


g4890638 


1272 


1587 


63 


231892.12.dec 


3078474H1 


57 


358 


63 


231892.12.dec 


g3883945 


1273 


1376 


63 


231892.12.dec 


1859775H1 


56 


115 


63 


231 892.1 2.dec 


3876771 H1 


1273 


1562 


63 


231892.12.dec 


2467634H1 


60 


288 


63 


231 892.1 2.dec 


g517972 


1281 


1592 


63 


231892.12.dec 


1393338H1 


59 


314 


63 


231892.12.dec 


g4628861 


1275 


1586 


63 


231892. 12.dec 


1711695H1 


59 


278 


63 


231892.12.dec 


g2788430 


1277 


1376 


63 


231892.12.dec 


2991815H1 


56 


322 


63 


231 892.1 2.dec 


g2062988 


1276 


1587 


63 


231 892.1 2.dec 


1317933H1 


57 


303 


63 


231892.12.dec 


6009646H1 


1280 


1553 


63 


231892.12.dec 


661199H1 


60 


320 


63 


231 892.1 2.dec 


g2768816 


1279 


1586 


63 


231892.12.dec 


1395795H1 


59 


268 


64 


197445.1. Oct 


3493649H1 


1829 


1955 


63 


231 892.1 2.dec 


2526346H1 


56 


303 


64 


1 97445.1 .oct 


g3870474 


1879 


2330 


63 


231 892.1 2.dec 


3245594H1 


61 


311 


64 


1 97445. 1.oct 


g3840082 


1879 


2324 


63 


231892.12.dec 


1959645H1 


57 


301 


64 


197445.1. Oct 


g43 11546 


1883 


2329 


63 


231892.12.dec 


3362542H1 


142 


360 


64 


1 97445.1 .oct 


g2210698 


1885 


2356 


63 


231 892.1 2.dec 


1 843501 H1 


142 


370 


64 


1 97445. 1.oct 


3254759H1 


1892 


2154 


63 


231 892.1 2.dec 


2700956H1 


142 


384 


64 


197445.1. oct 


4632923H1 


1331 


1601 
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64 


197445.1. oct 


3721127H1 


1342 


1645 


64 


197445.1. oct 


2216768H1 


1791 


1955 


64 


197445.1. oct 


26001 88H1 


1374 


1534 


64 


197445.1. oct 


1285672H1 


1806 


1944 


64 


1 97445. 1.oct 


776569H1 


1440 


1682 


64 


197445.1. oct 


2958707H1 


1517 


1755 


64 


197445.1. oct 


4213103H1 


1255 


1513 


64 


197445.1. oct 


g 1779556 


1554 


1964 


64 


197445.1. oct 


228661 8H1 


1268 


1500 


64 


197445.1. oct 


5579394H1 


1563 


1785 


64 


1 97445. 1.oct 


5743342H1 


1302 


1574 


64 


197445.1. oct 


463695T6 


1571 


1918 


64 


1 97445. 1.oct 


g4307960 


1 


483 


64 


197445.1 .oct 


1546362H1 


1603 


1805 


64 


1 97445. 1.oct 


g4243527 


1 


495 


64 


197445.1. oct 


5076525H1 


1225 


1498 


64 


1 97445. 1.oct 


g41 86422 


1 


486 


64 


197445.1. oct 


4163716H1 


1232 


1503 


64 


1 97445. 1.oct 


g4074327 


1 


507 


64 


197445.1. OCt 


g3034044 


1239 


1307 


64 


1 97445. 1.oct 


3810949H1 


1048 


1239 


64 


197445.1 .oct 


5075231 H1 


1252 


1427 


64 


197445.1. Oct 


1517215H1 


1613 


1789 


64 


197445.1 .oct 


4158314H1 


1645 


1888 


64 


1 97445. 1oct 


g4452855 


1640 


1955 


64 


197445.1 .OCt 


2201582H1 


1651 
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64 
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4158512H1 


1645 


1909 


64 
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g2583246 


1673 


1961 


64 
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4158413H1 


1645 


1867 


64 
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5897546H1 


1460 


1755 


64 
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1998 


2326 


64 
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g2539095 


1487 


1963 


64 


1 97445. 1.oct 
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202 


427 


64 
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1 83321 4H1 


1502 


1772 


64 


197445.1. Oct 


g3917499 


19 


484 


64 
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g3538527 


1512 


1959 


64 


197445.1. oct 


g 1069368 


64 


381 


64 


197445.1 .oct 


2440656H1 


1674 


1927 


64 


197445.1. OCt 


3383583H1 


100 


215 


64 


197445.1. oct 


1353593T6 


1677 


1916 


64 


1 97445. 1.oct 


1 59091 1H1 


104 


306 


64 


197445.1. oct 


g779497 


1679 


1864 


64 


197445.1. oct 


g1 81 0091 


155 


400 


64 


197445.1. OCt 


1353593H1 


1684 


1940 


64 


1 97445.1 .oct 


g1313174 


158 


552 


64 


197445.1. oct 


1353593F6 


1684 


1959 


64 


197445.1. oct 


709779H1 


192 


434 


64 


197445.1. oct 


1353593F1 


1684 


1959 


64 


197445.1. oct 


g1 383698 


218 


551 


64 


197445.1. oct 


4907774H1 


1710 


1967 


64 


197445.1. oct 


g3743490 


269 


455 


64 


197445.1 .oct 


26591 72H1 


915 


1155 


64 


197445.1. oct 


g4078298 


1 


455 


64 


197445.1. oct 


463695R6 


916 


1268 


64 


197445.1. oct 


g3888019 


1 


453 


64 


197445.1 .oct 


463695H1 


916 


1160 


64 


197445.1. oct 


1833139T6 


1808 


2281 


64 


197445.1. oct 


g2874288 


918 


1307 
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1287809F1 


1809 


2321 
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4215357H1 


919 


1207 
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3790190H1 


1910 


2195 


64 
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4841176H1 


1012 


1273 


64 
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g1 366863 


1932 


2349 


64 
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g2819160 


1034 


1238 
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g3837964 


1964 


2323 
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g383761 1 


1 


498 
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g22 11068 
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2356 
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g3834905 


1 


464 
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g3108713 


1981 


2340 
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g4085770 


1 


408 
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054382H1 
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2183 
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g3841825 


1 


454 
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1991 


2289 
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g3838085 


1 


463 
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1683894H1 


1998 


2221 
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g3917812 


1 


477 
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3152839H1 


552 


834 
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1833139H1 


754 


1009 
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5084942H1 


564 


814 
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g21 14631 


769 


1231 
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3627622H1 


579 


882 
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g3803455 


831 


1230 
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5657362H1 


635 


878 
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g1 35031 9 


882 


1447 
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g21 14928 


704 


1107 


64 
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g1954291 


889 


1209 


64 
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g21 67371 


706 


1212 


64 
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3334792H1 


898 


1059 


64 
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g2210757 


722 


1166 


64 


197445.1. oct 


3810949F6 


1046 


1481 


64 


197445.1. OCt 


g2210776 


722 


1167 
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2915878H1 


1075 


1260 
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754 


1217 
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1154 
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1514 


1761 
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1162 


1635 
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1514 


1762 


64 
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4750730H1 


1170 


1413 
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5289647H1 


2272 


2531 
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2101538H1 


1208 


1437 


64 


197445.1. oct 


g2575310 


2367 


2718 
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4334028H1 


1207 


1472 
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2216 


2279 
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285 


571 
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g1 209957 


2228 


2631 
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g1 382633 


328 


537 
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g2742820 


2263 


2326 
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g21 88701 


408 


799 
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g21 85069 


1998 


2329 
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368533H1 


453 


612 
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1998 


2211 
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193222H1 


467 


669 
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1458660H1 


2024 


2301 


64 


197445.1. oct 


5089936H1 


538 


711 


64 


197445.1. OCt 


2507791 H1 


2037 


2266 


64 


197445.1. oct 


g1 241 975 


548 


746 


64 


197445.1. oct 


2293052H1 


2048 


2314 


65 


348775.1. oct 


52731 23H1 


1 


260 


64 


197445.1. oct 


2450647H1 


2048 


2129 


65 


348775.1. oct 


g4457572 


1 


254 
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197445.1. oct 


1287809H1 


2080 


2321 


65 


348775.1 -oct 


4247889H1 


10 


269 
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197445.1. oct 


2767430H1 


2083 


2320 


65 


348775.1. oct 


4247889F6 


10 


350 
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1288563H1 


2085 


2321 


65 


348775.1. oct 


4314064F6 


18 


243 


64 


197445.1. oct 


g31 73735 


2106 


2320 


65 


348775.1. oct 


4314064H1 


18 


247 
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197445.1. oct 


g1614963 


2143 


2318 


65 


348775.1. oct 


4249504 R6 


34 


233 


64 


197445.1. oct 


5113750H1 


1715 


1985 


65 


348775.1 .oct 


2020732H1 


62 


170 


64 


197445.1. OCt 


g2825974 


1736 


1960 


65 


348775.1. oct 


144037H1 


72 


380 
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65 

w w 
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978302. 3.dec 


g1218452 


2418 


2554 


70 


228629.11. dec 


312757H1 


1437 


1655 


69 


978302.3.dec 


1649824H1 


2444 


2585 


70 


228629.11. dec 


312451 H1 


1438 


1604 


69 


978302.3.dec 


g 1957671 


2454 


2599 


70 


228629. 11. dec 


312476H1 


1438 


1604 


69 


978302.3-dec 


g3960735 


2464 


2552 


70 


228629.11. dec 


2639058H1 


1437 


1688 


69 


978302.3.dec 


42741 32H1 


2471 


2609 


70 


228629.11. dec 


2328889H1 


1523 


1816 


70 


228629.11. dec 


g2934289 


1704 


2059 


70 


228629.11. dec 


2751426H1 


1457 


1716 


70 


228629.11. dec 


g21 02954 


1705 


2089 


70 


228629.11. dec 


4344049H1 


1462 


1753 


70 


228629.11. dec 


g3 147325 


1707 


2091 


70 


228629.11. dec 


3771512H1 


1462 


1733 


70 


228629.11 dec 


001420H1 


1699 


2104 


70 


228629.11. dec 


2099424H1 


1473 


1739 


70 


228629. 11. dec 


1294966H1 


1119 


1358 


70 


228629.11. dec 


025702 H1 


1231 


1388 


70 


228629.11. dec 


6374472H1 

X^X^ W ■ ■ * * ■ ■ ■ ■ 


1119 


1359 


70 


228629 1 1 dec 

l»feWW»wV ■ ■ • * X-* X^X^ 


882702H1 


1372 


1626 


70 


228629.11. dec 


2743486H1 


1193 


1430 


70 


228629.11. dec 


g5365368 


1864 


2085 


70 


228629.11. dec 


4531151H1 


1122 


1400 


70 


228629.11. dec 


308831 9H1 


1868 


2085 


70 


228629 1 1 dec 


6359587H2 

x^x^x^x^x^x^ / » ■ 


1197 


1749 


70 


228629 1 1 dec 


02782978 


1882 


2085 


70 


228629.11. dec 


5734367H1 

™» ■ x^ ■ x^ x»* # ■ ■ ■ 


1175 


1374 


70 


228629 1 1 dec 

* "' * X^ X^^»X^ ■ • ■ * X^ V»» X' 


39321 84H1 


1452 


1729 


70 


228629.11. dec 


892335H1 

X^ x^ b* X^ X^ x^ ■ 1 9 


1758 


1948 


70 


228629.11 dec 

" ■ X^ X^ ■ »X^ ■ 9 ■ »X^X**X^ 


6587364H1 

X^X*X^ T %X X^ ■ ■ ■ 9 


1378 


1921 


70 


228629.11. dec 


0648436 

XJ x^ rw P/ X^Xrf^ 


1762 


2100 


70 


228629 1 1 dec 

# ■ ■ ■ X^ X^ ■ ^* ■ 1 ■ .\r* W 


1296630H1 

■ X^ X^ X^X^ X^ ■ • f 


1380 


1596 


70 


228629.11. dec 


975009H1 

X^ ¥ X^ X^ V* X^ • ■ ■ 


1776 


2069 


70 


228629.11. dec 


3126240H1 


1380 


1655 


70 


228629.11. dec 


Q3889950 

X^ w x^ x^ 


1810 


2086 


70 


228629. 11 -dec 


1979479H1 

■ X^ ■ X^ W 9 V * ~ ■ 


1383 


1654 


70 


228629.11. dec 


887059H1 


1839 


2085 


70 


228629.11. dec 


258521 7H1 


1324 


1571 


70 


228629.11. dec 


869989 H1 


1840 


2086 


70 


228629.11. dec 


g756027 


1283 


1509 


70 


228629.11. dec 


869854H1 


1840 


2085 


70 


228629.11. dec 


955073H1 


1361 


1648 


70 


228629.11. dec 


g 1626689 


1735 


2084 


70 


228629.11. dec 


2528386H1 


1536 


1858 


70 


228629.11. dec 


g3921132 


1743 


2093 


70 


228629.11. dec 


1225607H1 


1548 


1788 


70 


228629.11. dec 


856889H1 


1754 


1937 


70 


228629.11. dec 


301236H1 


1439 


1648 


70 


228629.11. dec 


3614651 H1 


1 


217 


70 


228629.11. dec 


231848H1 


1438 


1610 


70 


228629.11. dec 


2689769H1 


17 


269 


70 


228629.11. dec 


5185095H1 


1444 


1675 


70 


228629. 11. dec 


2689769F6 


17 


478 


70 


228629.11. dec 


3502894H1 


1447 


1731 


70 


228629.11. dec 


6112670H1 


27 


280 


70 


228629.11. dec 


1800594H1 


1518 


1729 


70 


228629.11. dec 


g603230 


50 


2085 


70 


228629.11. dec 


g1331191 


1522 


1956 


70 


228629.11. dec 


6534961 H1 


63 


490 


70 


228629.11. dec 


1800568H1 


1518 


1740 


70 


228629.11. dec 


410074H1 


129 


368 


70 


228629.11. dec 


476384H1 


1654 


1922 


70 


228629.11. dec 


3717155H1 


246 


474 


70 


228629.11. dec 


1752755H1 


1661 


1913 


70 


228629.1 1 .dec 


5301716H1 


322 


592 


70 


228629.11. dec 


1754354H1 


1661 


1907 


70 


228629.11. dec 


6290096H1 


577 


811 


70 


228629.11. dec 


g834745 


1663 


1935 


70 


228629.11. dec 


2911910H1 


590 


856 


70 


228629.11. dec 


q838552 


1663 


1890 


70 


228629.11. dec 


1300552H1 


607 


836 


70 


228629.11. dec 


g4686244 


1673 


2085 


70 


228629.11. dec 


1413704H1 


625 


902 


70 


228629.11. dec 


4730903H1 


1612 


1880 


70 


228629.11. dec 


3661737H1 


659 


934 


70 


228629.11. dec 


g41 88496 


1639 


2090 


70 


228629.11. dec 


4434566H1 


735 


990 


70 


228629.11. dec 


4633239H1 


1640 


1885 


70 


228629.11. dec 


1976292H1 


796 


1038 


70 


228629. 11. dec 


004785H1 


1581 


1877 


70 


228629. 11. dec 


6297547H1 


804 


1068 


70 


228629.11. dec 


4596660H1 


1583 


1841 


70 


228629.11. dec 


910262H1 


827 


1122 


70 


228629. 11. dec 


5951768H1 


1588 


1934 


70 


228629.11. dec 


1701222H1 


827 


1038 


70 


228629.11. dec 


5321992H1 


1598 


1863 


70 


228629.11. dec 


6131544H1 


850 


1153 


70 


228629.11. dec 


5004692H1 


1598 


1839 


70 


228629.11 -dec 


6361603H2 


891 


1430 


70 


228629.11. dec 


6103423H1 


1549 


1851 


70 


228629.11. dec 


1433660H1 


898 


1136 


70 


228629.11. dec 


221523H1 


1550 


1656 


70 


228629.11. dec 


1395790H1 


951 


1201 


70 


228629.11. dec 


3450038H1 


1555 


1798 


70 


228629.11. dec 


3253094H1 


959 


1157 


70 


228629. 11. dec 


541569H1 


1575 


1796 


70 


228629.11. dec 


4710208H1 


974 


1235 


70 


228629.11. dec 


816658H1 


1575 


1812 


70 


228629.11. dec 


6490580H1 


1001 


1377 


70 


228629.11. dec 


2104606H1 


1385 


1645 


70 


228629.11. dec 


6490280H1 


1001 


1460 


70 


228629.11. dec 


5927924H1 


1387 


1682 


70 


228629.11. dec 


4749725H1 


1006 


1274 


70 


228629.11. dec 


1496306H1 


1409 


1650 


70 


228629.11. dec 


1651252H1 


1006 


1249 


70 


228629.11. dec 


2770041 H1 


1690 


1943 


70 


228629.11. dec 


1 974061 H1 


1031 


1295 


70 


228629.11. dec 


1751734H1 


1698 


1922 


70 


228629.11. dec 


2412754H1 


1033 


1259 


70 


228629.11. dec 


3164903H1 


1698 


1985 


70 


228629.11. dec 


4438041 H1 


1052 


1290 


71 


011211.5.dec 


g3229245 


1492 


1675 


70 


228629.11. dec 


4437941 H1 


1052 


1293 


71 


011211.5.dec 


g3055421 


1589 


1677 


70 


228629.11. dec 


g715781 


1114 


1395 


71 


011211.5.dec 


g2355252 


1517 


1668 


70 


228629.11. dec 


g2141297 


1480 


1638 


71 


011211.5-dec 


g4307448 


1534 


1669 


70 


228629.11. dec 


3228693T6 


1484 


2060 


71 


011211.5.dec 


4422963H1 


252 


499 


70 


228629.11. dec 


207391 H1 


1929 


2085 


71 


011211.5.dec 


4422989H1 


252 


485 


70 


228629.11. dec 


211562H1 


1929 


2085 


71 


011211.5.dec 


389991 4H1 


266 


534 


70 


228629.11. dec 


3566051 H1 


1415 


1651 


71 


011211.5.dec 


1796354H1 


291 


508 


70 


228629.11. dec 


388535H1 


1421 


1701 


71 


011211.5.dec 


g1 329775 


351 


625 


70 


228629.11. dec 


505367H1 


1432 


1643 


71 


011211.5.dec 


3248726H1 


374 


682 


70 


228629.11. dec 


1358153H1 


1434 


1682 


71 


011211.5.dec 


458331 5H1 


395 


674 
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71 


01121 1.5.dec 


730632H1 


414 


637 


71 


011211.5.dec 


g278591 1 


1312 


1665 


71 


01121 1.5.dec 


790391 R1 


429 


1010 


71 


011211.5.dec 


2500452F6 


1 


490 


71 


01121 1.5.dec 


790391 H1 


429 


671 


71 


011211.5.dec 


2500452H1 


1 


267 


71 


01121 1.5.dec 


4518427H1 


442 


687 


71 


01121 1.5.dec 


2504526H1 


50 


289 


71 


011211.5.dec 


293581 6H1 


445 


695 


71 


011211.5.dec 


2485204H1 


130 


370 


71 


01121 1.5.dec 


2722677F6 


470 


965 


71 


011211.5.dec 


4590655H1 


205 


450 


71 


011211.5.dec 


1788251H1 


563 


812 


71 


011211.5.dec 


1 68631 9H1 


205 


396 


71 


011211.5-dec 


5116027H1 


567 


827 


71 


011211.5.dec 


3211206H1 


205 


327 


71 


011211.5.dec 


5116074H1 


567 


821 


71 


011211.5.dec 


3411833H1 


205 


437 


71 


011211.5.dec 


3986358H1 


596 


792 


71 


011211.5.dec 


3271955H1 


205 


464 


71 


011211.5.dec 


4203841 H1 


612 


909 


71 


011211.5.dec 


g2161312 


1211 


1669 


71 


011211.5.dec 


5314859H1 


675 


815 


71 


011211.5.dec 


g1 358211 


1227 


1675 


71 


011211.5-dec 


3740285H1 


1313 


1599 


71 


011211.5.dec 


g1 801 782 


1241 


1670 


71 


011211.5.dec 


5121062T6 


1314 


1660 


71 


011211.5-dec 


g41 53547 


1241 


1669 


71 


011211.5.dec 


g4664357 


1318 


1671 


71 


011211.5.dec 


g1317339 


1255 


1669 


71 


011211.5.dec 


g 1940230 


1324 


1650 


71 


011211.5.dec 


239541 0T6 


1256 


1632 


71 


011211.5.dec 


g2 184545 


1324 


1650 


71 


011211.5.dec 


1811608T6 


1262 


1629 


71 


011211.5.dec 


g3665509 


1326 


1676 


71 


011211.5.dec 


348838 1T6 


1262 


1627 


71 


011211.5.dec 


2722677T6 


1338 


1627 


71 


011211.5.dec 


g5674690 


1262 


1671 


71 


011211.5.dec 


g2952637 


1338 


1673 


71 


011211.5.dec 


536468H1 


1269 


1522 


71 


011211.5.dec 


2474881 H1 


173 


407 


71 


011211.5-dec 


730632F1 


1185 


1671 


71 


011211.5.dec 


3580753H1 


177 


489 


71 


011211.5.dec 


g5637214 


1207 


1671 


71 


011211.5.dec 


3581452H1 


177 


467 


71 


011211.5.dec 


1639808H1 


1208 


1418 


71 


01121 1.5.dec 


486671 H1 


187 


473 


71 


011211.5.dec 


g5659631 


1208 


1668 


71 


011211.5.dec 


25441 19H1 


194 


447 


71 


011211.5.dec 


2750833H1 


1052 


1190 


71 


011211.5.dec 


6370602H1 


204 


312 


71 


011211.5.dec 


2750833R6 


1052 


1313 


71 


011211.5.dec 


589841 9H1 


204 


489 


71 


011211.5.dec 


6176275H1 


1064 


1344 


71 


011211.5.dec 


3438381 H1 


204 


455 


71 


011211.5.dec 


628273H1 


1064 


1288 


71 


011211.5.dec 


5121062F6 


205 


670 


71 


01121 1.5.dec 


2525968H1 


1067 


1304 


71 


011211.5.dec 


3784294H1 


205 


502 


71 


011211.5-dec 


5954055H1 


1090 


1214 


71 


011211.5.dec 


2107539H1 


207 


458 


71 


011211.5-dec 


1534852T6 


1102 


1626 


71 


011211.5-dec 


3732863H1 


205 


469 


71 


011211.5.dec 


2354626H1 


1110 


1325 


71 


011211.5.dec 


1721414H1 


207 


420 


71 


011211.5.dec 


6603759H1 


1125 


1673 


71 


011211.5-dec 


g1 357922 


205 


599 


71 


011211.5.dec 


6603659H1 


1126 


1583 


71 


011211.5-dec 


2211136H1 


208 


407 


71 


011211.5.dec 


5206520T6 


1140 


1643 


71 


011211.5.dec 


2722677H1 


470 


713 


71 


011211.5.dec 


780467T6 


1157 


1624 


71 


011211.5.dec 


1811608F6 


478 


915 


71 


011211.5.dec 


483761 5H1 


1160 


1458 


71 


011211.5-dec 


1811608H1 


478 


721 


71 


011211.5.dec 


538428H1 


1164 


1311 


71 


01121 1.5.dec 


1809182H1 


504 


669 


71 


011211.5.dec 


6393337H1 


813 


1108 


71 


01121 1.5.dec 


1979530H1 


510 


755 


71 


011211.5.dec 


3438482H1 


851 


1098 


71 


011211.5.dec 


6557538H1 


516 


1108 


71 


01121 1.5.dec 


4185657H1 


871 


1086 


71 


011211.5.dec 


5119348H1 


522 


817 


71 


011211.5.dec 


6428245H1 


891 


1480 


71 


011211.5.dec 


5313378H1 


241 


379 


71 


011211.5.dec 


g21 88554 


894 


1044 


71 


011211.5.dec 


3342386H1 


243 


497 


71 


011211.5.dec 


1212647H1 


920 


1152 


71 


011211.5.dec 


4896741 H1 


252 


543 


71 


011211.5.dec 


2772091 F6 


929 


1255 


71 


01121 1.5.dec 


1514966H1 


229 


402 


71 


011211.5-dec 


2772091 H1 


929 


1176 


71 


011211.5.dec 


3982780H1 


234 


509 


71 


011211.5.dec 


3179605H1 


945 


1247 


71 


011211.5.dec 


4731415H1 


239 


512 


71 


011211.5.dec 


5816063H1 


946 


1181 


71 


011211.5-dec 


g2659808 


1475 


1671 


71 


011211.5.dec 


4691156H1 


954 


1182 


71 


011211.5.dec 


g4307315 


1488 


1669 


71 


011211.5-dec 


3411470H1 


956 


1210 


71 


011211.5-dec 


25621 77H1 


205 


462 


71 


011211.5.dec 


4278241 H1 


970 


1200 


71 


011211.5.dec 


1865272H1 


205 


449 


71 


011211.5.dec 


2920077H1 


1021 


1284 


71 


011211.5.dec 


1865272F6 


205 


602 


71 


011211.5-dec 


5119720H1 


1027 


1297 


71 


011211.5.dec 


3053770H1 


220 


511 


71 


01121 1.5.dec 


4939387H1 


1037 


1282 


71 


011211.5.dec 


30571 58H1 


220 


425 


71 


01121 1.5.dec 


2750833T6 


1041 


1626 


71 


011211.5.dec 


g1 31 5022 


224 


665 


71 


011211.5.dec 


2500452T6 


1045 


1623 


71 


011211.5-dec 


239541 0F6 


224 


572 


71 


011211.5.dec 


1904368H1 


1047 


1338 


71 


011211.5-dec 


239541 0H1 


224 


463 


71 


011211.5.dec 


1904368F6 


1047 


1344 


71 


011211.5.dec 


5374094H1 


224 


437 


71 


011211.5.dec 


g5540370 


1392 


1672 


71 


011211.5-dec 


g1 802011 


227 


661 


71 


011211.5.dec 


1865272T6 


1398 


1839 


71 


01121 1.5.dec 


g831726 


229 


623 


71 


011211.5-dec 


277209 1T6 


1403 


1625 


71 


011211.5-dec 


g5636952 


1271 


1671 


71 


011211.5.dec 


2158516H1 


1427 


1547 


71 


011211.5.dec 


g5530755 


1282 


1673 


71 


011211.5.dec 


g2321614 


1427 


1669 


71 


011211.5.dec 


g5659613 


1283 


1668 


71 


011211.5.dec 


g4565430 


1434 


1669 


71 


011211.5.dec 


2589181 H1 


1304 


1531 


71 


011211.5.dec 


790391 F1 


1443 


1669 


71 


011211.5.dec 


g3214484 


1309 


1674 


71 


011211.5.dec 


g2O21003 


1472 


1659 


71 


011211.5.dec 


g2354244 


1310 


1668 


71 


011211.5.dec 


4129420H1 


209 


384 
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71 


01121 1.5.dec 


4931939H1 


213 


502 


71 


011211.5.dec 


5299520H1 


212 


447 


71 


011211.5.dec 


7781 31 H1 


213 


424 


71 


011211.5.dec 


2995677H1 


212 


463 


71 


011211.5.dec 


3528587H1 


213 


504 


71 


011211.5.dec 


4192842H1 


213 


507 


71 


011211.5.dec 


2828929H1 


214 


474 


71 


01121 1.5.dec 


4689695H1 


218 


443 


71 


01121 1.5.dec 


5299661 H1 


219 


392 


71 


011211.5.dec 


3783536H1 


219 


526 


71 


011211.5.dec 


2432292H1 


219 


438 


71 


011211.5.dec 


2230623H1 


679 


906 


71 


011211.5.dec 


5578645H1 


734 


999 


71 


011211.5.dec 


4884278H1 


752 


1013 


71 


011211.5.dec 


3489306H1 


756 


1044 


71 


011211.5.dec 


48591 62H1 


760 


1036 


71 


011211.5.dec 


3451104H1 


765 


1019 


71 


011211.5.dec 


2605496H1 


766 


1012 


71 


011211.5.dec 


4199159H1 


782 


1056 


71 


01121 1.5.dec 


4858453H1 


783 


962 


71 


011211.5.dec 


g5634460 


792 


878 


71 


011211.5.dec 


1605934H1 


809 


1037 


71 


011211.5.dec 


5106375H1 


811 


1052 


71 


011211.5.dec 


1904368T6 


1371 


1629 


71 


011211.5.dec 


g2875270 


1371 


1595 


71 


011211.5.dec 


g2540410 


1373 


1672 


71 


011211.5.dec 


g831607 


1356 


1680 


71 


011211.5.dec 


6394443H1 


1348 


1612 


71 


01121 1.5.dec 


g21 84320 


1375 


1667 
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TABLE 5 

SEQ ID NO: Template ID Tissue Distribution 

1 4053 1 0. 1 .oct Cardiovascular System - 1 6%, Germ Cells - 1 2% 

2 480731 .6.oct Liver - 25%, Connective Tissue - 1 7%, Male Genitalia - 1 5% 

3 334751 .2.dec Germ Cells - 33%, Uver - 24%, Endocrine System - 1 3% 

4 237330.8.dec Respiratory System - 60%, Male Genitalia - 40% 

5 053778. 1 1 .dec Embryonic Structures - 67%, Nervous System - 1 7%, Digestive System - 

17% 

6 360645. 1 0.dec Embryonic Structures - 67%, Nervous System - 1 1 % 

8 997089.7.dec Sense Organs - 1 1 % 

9 237152.1 .dec Unclassified/Mixed - 41%, Germ Cells - 19%, Embryonic Structures - 

13%, Respiratory System - 13% 

10 232851 .7.dec Hemic and Immune System - 100% 

1 1 083804. 1 .dec Germ Cells - 70%, Connective Tissue - 1 6% 

1 2 27272 1 .6.oct Digestive System - 1 0% 

13 46 1 603.4.oct Germ Cells - 1 7%, Sense Organs - 1 4%, Musculoskeletal System - 1 0% 

14 332465.2.dec Connective Tissue - 100% 

1 5 4451 75.3.dec Germ Cells - 58%, Embryonic Structures - 20%, Male Genitalia - 1 7% 

1 6 980541 . 1 .dec Nervous System - 50%, Embryonic Structures - 43% 

1 7 237996. 1 .dec Cardiovascular System - 21%, Exocrine Glands - 21%, Respiratory 

System - 16%, Digestive System - 16%, Hemic and Immune System - 
16% 

1 9 242082. 1 0.dec Female Genitalia - 75%, Hemic and Immune System - 25% 

20 01 9239. 1 .dec Exocrine Glands - 26%, Endocrine System - 1 7%, Hemic and Immune 

System - 1 7% 

2 1 899943. 1 .dec Sense Organs - 1 4%, Musculoskeletal System - 1 2% 

22 443551 . 1 .dec Unclassified/Mixed - 50%, Nervous System - 27%, Female Genitalia - 

14% 

23 897957. 1 .dec Pancreas - 44%, Uver - 27%, Male Genitalia - 1 0% 

24 90091 1 . 1 .dec Germ Cells - 69%, Nervous System - 1 2% 

25 999296. 1 .dec Exocrine Glands - 22%, Endocrine System - 16%, Female Genitalia - 

16% 

26 442286. 1 .dec Respiratory System - 50%, Exocrine Glands - 33%, Hemic and Immune 

System -17% 

27 901 978. 1 .dec Musculoskeletal System - 25%, Unclassified/Mixed - 1 8%, Hemic and 

Immune System - 1 3% 

28 479346. 1 .dec Unclassified/Mixed - 27%, Embryonic Structures - 20%, Skin - 1 4% 

29 48 1 750. 1 .dec Male Genitalia - 29%, Urinary Tract - 1 3%, Skin - 1 1 % 

30 90091 7.2.dec Respiratory System - 1 00% 

3 1 9994 1 5. 1 .dec Connective Tissue - 32%, Cardiovascular System - 32%, Exocrine 

Glands- 16% 

32 900680.2.dec Embryonic Structures - 34%, Liver - 1 9%, Unclassified/Mixed - 1 6% 

33 902791 .3.dec Urinary Tract - 72%, Nervous System - 1 7%, Hemic and Immune System ■ 

11% 

34 053826. 1 .dec Germ Cells - 69%, Unclassified/Mixed - 22% 

35 204932.4.dec NO DATA 

36 400607. 1 9.dec Musculoskeletal System - 50%, Cardiovascular System - 29%, Nervous 

System -21% 

37 444248, 7.dec Exocrine Glands - 57%, Digestive System - 43% 

38 346599.9.dec Uver - 30%, Pancreas - 26%, Respiratory System - 21 % 

40 411396.24.dec Male Genitalia - 100% 

41 302819.4.dec Exocrine Glands - 100% 
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TABLE 5 

SEQ ID NO: Template ID Tissue Distribution 

42 238734.2.dec Skin - 94% 

43 399525.3.dec Unclassified/Mixed - 34%, Connective Tissue - 25%, Exocrine Glands - 

13% 

44 222795.6.dec widely distributed 

45 410628.5.dec Sense Organs - 32%, Nervous System - 12%, Urinary Tract - 1 1%, 

Connective Tissue - 1 1% 

46 053649.6.dec Skin - 89%, Male Genitalia - 1 1 % 

47 221914.2.dec Connective Tissue - 18%, Skin - 18%, Exocrine Glands - 13% 

49 401 482.2.oct Skin - 1 2%, Connective Tissue - 1 0% 

50 274551 . 1 .oct Nervous System - 60%, Hemic and Immune System - 40% 

51 411 408.20.dec Connective Tissue - 1 2%, Cardiovascular System - 1 1 % 

52 035973. 1 .dec Embryonic Structures - 67%, Nervous System - 1 7%, Digestive System - 

17% 

53 456536. 1 .dec widely distributed 

54 387807.4.oct Sense Organs - 33%, Skin -11%, Male Genitalia - 1 0% 

55 406790.3.dec Unclassified/Mixed - 32%, Urinary Tract - 25%, Musculoskeletal System 

10% 

56 412420.63.dec Sense Organs - 40% 

57 196623.3.dec widely distributed 

58 42791 6.8.dec Nervous System - 1 00% 

59 264633.8.dec Embryonic Structures - 86%, Hemic and Immune System - 14% 

61 902943. 1 .dec Unclassified/Mixed - 38%, Female Genitalia - 1 9%, Respiratory System 

10% 

64 1 97445. 1 .oct Unclassified/Mixed - 28% 

65 348775. 1 .oct Skin - 28%, Connective Tissue - 1 5%, Nervous System - 1 5% 

66 336239.5.dec Hemic and Immune System - 100% 

67 2 1 5660.4.dec Nervous System - 1 00% 

68 391940.2.dec Hemic and Immune System - 58%, Male Genitalia - 26%, Respiratory 

System- 16% 

69 978302.3.dec Sense Organs - 1 6%, Germ Cells -11%, Embryonic Structures - 1 1 % 

70 228629. 1 1 .dec Digestive System - 60%, Hemic and Immune System - 40% 

71 01 121 1 .5.dec Musculoskeletal System - 32%, Endocrine System - 27%, Nervous 

System - 18% 
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CLAIMS 

What is claimed is: 

1. An isolated polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1-71, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a) through d). 

2. An isolated polynucleotide of claim 1 , comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO: 1-71 . 

3. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a polynucleotide 
of claim 1. 

4. A composition for the detection of expression of diagnostic and therapeutic polynucleotides 
comprising at least one of the polynucleotides of claim 1 and a detectable label. 

5. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

6. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
comprising a sequence of a polynucleotide of claim 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
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specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization complex 
is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if present, 
the amount thereof. 

7. A method of claim 5, wherein the probe comprises at least 30 contiguous nucleotides. 

8. A method of claim 5, wherein the probe comprises at least 60 contiguous nucleotides. 

9. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 1 . 

10. A cell transformed with a recombinant polynucleotide of claim 9. 

11. A transgenic organism comprising a recombinant polynucleotide of claim 9. 

12. A method for producing a diagnostic and therapeutic polypeptide, the method comprising: 

a) culturing a cell unto conditions suitable for expression of the diagnostic and therapeutic 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide of claim 9, and; 

b) recovering the diagnostic and therapeutic polypeptide so expressed. 

13. A purified diagnostic and therapeutic polypeptide encoded by at least one of the 
polynucleotides of claim 2. 

14. An isolated antibody which specifically binds to a diagnostic and therapeutic polypeptide 
of claim 13. 

15. A method of identifying a test compound which specifically binds to the diagnostic and 
therapeutic polypeptide of claim 13, the method comprising the steps of: 

a) providing a test compound; 

b) combining the diagnostic and therapeutic polypeptide with the test compound for a 
sufficient time and under suitable conditions for binding; and 
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c) detecting binding of the diagnostic and therapeutic polypeptide to the test 
compound, thereby identifying the test compound which specifically binds the diagnostic and 
therapeutic polypeptide. 

16. A microarray wherein at least one element of the microarray is a polynucleotide of claim 3. 

17. A method for generating a transcript image of a sample which contains polynucleotides, 
the method comprising the steps of: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 16 with the labeled polynucleotides of the 
sample under conditions suitable for the formation of a hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

1 8. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a polynucleotide sequence of claim 1, the 
method comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts of 
the compound and in the absence of the compound. 

19. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 1 
or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the amount 
of hybridization complex in the treated biological sample is indicative of toxicity of the test compound. 
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(57) Abstract: The present invention provides purified human polynucleotides for diagnostics and therapeutics (dithp). Also en- 
compassed are the polypeptides (DITHP) encoded by dithp. The invention also provides for the use of dithp, or complements, 
oligonucleotides, or fragments thereof in diagnostic assays. The invention further provides for vectors and host cells containing 
dithp for the expression of DITHP. The invention additionally provides for the use of isolated and purified DITHP to induce antibod- 
ies and to screen libraries of compounds and the use of anti-DITHP antibodies in diagnostic assays. Also provided are microarrays 
containing dithp and methods of use. 
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<170> PERL Program 
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<211> 3211 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 405310. l.oct 



<220> 

<221> unsure 

<222> 91, 105, 119, 122, 124, 131, 139, 143, 211, 262, 288, 318, 342, 347, 

399, 414, 416, 421 

<223> a, t, c, g, or other 
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aaattaagat taattttcac taaccagctg aatactattg agtactatca agtgttgcaa 60 
caaatcttac cttctaataa aggtggctca ncctcaaagt tgttnccata gaaaggctna 120 
gntnaagctg nagtatatnc ctnagttggc tggtaaatct gcccggtgta tggctgttgt 180 
ggctgcatca tgtctggagg gacaaatctg ncttgctcga atagtcatag ccagcatact 240 
gttatagggt cctccacttc cnccataatc ataggactgc tgtgactnat catcgatgct 300 
gtaacttgtc tggtaganat ccgtgtttaa gttttcaaag cntgacnttg caaataatca 3 60 
aactctgagg ctgcgctgac ggcaaaggca gtagcttcnc taatcccata caancnccaa 420 
ncgctggttc agggtctgaa acagagtttg ggggttgttt gggattagtg aagctactgc 480 
ctttgccgcc agcgcagcct cagagtttga ttatttgcaa tgtcaggctt tgaaaactta 540 
aacacggatt tctaccagac aagttacagc atcgatgatc agtcacagca gtcctatgat 600 
tatggaggaa gtggaggacc ctatagcaaa cagtatgctg gctatgacta ttcgcagcaa 660 
ggcagatttg tccctccaga catgatgcag ccacaacagc catacaccgg gcagatttac 720 
cagccaactc aggcatatac tccagcttca cctcagcctt tctatggaaa caactttgag 780 
gatgagccac ctttattaga agagttaggt atcaattttg accacatctg gcaaaaaaca 840 
ctaacagtat tacatccgtt aaaagtagca gatggcagca tcatgaatga aactgatttg 900 
gcaggtccaa tggttttttg ccttgctttt ggagccacat tgctactggc tggcaaaatc 960 
cagtttggct atgtatacgg gatcagtgca attggatgtc taggaatgtt ttgtttatta 1020 
aacttaatga gtatgacagg tgtttcattt ggttgtgtgg caagtgtcct tgggatattg 1080 
tcttctgccc atgatcctac tttccagctt tgcagtgata ttttctttgc aaggaatggt 1140 
aggaatcatt ctcactgctg ggattattgg atggtgtagt ttttctgctt ccaaaatatt 1200 
tatttctgca ttagccatgg aaggacagca acttttagta gcatatcctt gcgctttgtt 12 60 
atatggagtc tttgccctga tttccgtctt ttgaaaattt atctgggatg tggacatcag 13 2 0 
tgggccagat gtacaaaaag gaccttgaac tcttaaattg gaccagcaaa ctgctgcagc 13 80 
gcaactctca tgcagattta catttgactg ttggagcaat gaaagtaaac gtgtatctct 1440 
tgttcatttt tatagaactt ttgcatacta tattggattt acctgcggtg tgactagctt 1500 
taaatgtttg tgtttataca gataagaaat gctatttctt tctggttcct gcagccattg 1560 
aaaaaccttt ttccttgcaa attataatgt ttttgataga tttttatcaa ctgtgggaaa 1620 
ccaaacacaa agctgataac ctttcttaaa aacgacccag tcacagtaaa gaagacacaa 1680 
gacttactgc aaaaatattt ttccaaggat ttaggaaaga aaaattgcct tgtattctca 1740 
agtcaggtaa ctcaaagcaa aaaagtgatc caaatgtaga gtatgagttt gcactccaaa 1800 
aatttgacat tactgtaaat tatctcatgg aatttttgct aaaattcaga gatacgggaa 1860 
gttcacaatc taccttattg tagacatgaa atgcgaacac ttacttacat attaatgtta 192 0 
actcaacctt agggacctgg aatggttgca ttaatgctat aatcgttgga tcgccacatt 1980 
tcccaaaaat aataaaaaaa tcactaacct tttttaagga aaatatttaa agttttacaa 2040 
aattcaatat tgcaattatc aatgtaaagt acatttgaat gcttattaaa actttcccaa 2100 
ttaattttaa ctgtgttatt gaatttactt ttactaaact actgttctct ttgtctcttt 2160 
tttaactagg ctctgatttt gacccctaat ttaagcttta agaatagaaa tcagctaata 222 0 
tagaatcaga caaaaagggg attaaatgag cagtttgagt tacattattt tattgtatta 2280 
aatttatttg atttatattg tcatgttctc ttgccagaga gaatctgtag gaaaatactg 2340 
tatcttgtat actgatcatt ggctttttct agaaaaactg tctctgattc tggacaaagc 2400 
tcagttatag ttacgagaaa gatatggtac agggaggaaa atactgcctt tttttttttt 24 60 
ttaaagagat tttcagacta aatagaaatg tcaaaatgat gtatcaatgg ttctttttta 2520 
gaacaagttt tcaaagcata aaaagaggtt gagagaaata acatatttat tgattcacat 2580 
aagtatgttt ttcttcatta atcgtctgga gaaacccact tgtcattaat ttgttttggg 2640 
ctaggttttc aaacttacca aattgcttta aaaaagcaat ttggaaggta atttgatagg 2700 
ctttccaact taaccaaatt ttttattgta attcttggat agtatttttg tctttttcaa 2760 
ttcatttgtc tttttcagta tagtttttgt taaggcaaat gtcttccctt aatatccaaa 2820 
tattgctaat aaacggtaga agatgctttg gaaattaaaa ttatctcgct gttggttaga 2880 
cttaacactg ttaatcttca gccaaatatc acatatggat caaattattt tcttttttgt 2940 
tgtttaccct atcctcaaca acatttttag tttaaattat tgtagagatt ttttttgtgg 3000 
tggttatttt ttattttgct ccaaaataat aaggtgcaaa gctattttat gcttaactgt 3060 
tgctctgtca aaacagctat gcagtggagt tgcatttgat gttctagagt ttgattacat 3120 
gcagagttgt atatagccaa aacttctctt atcaaactct gttatgtagg catatttata 3180 
tatacattaa agactgttgt actgtgtctc a 3211 

<210> 2 

<211> 742 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 480731. 6. oct 

<400> 2 

agcttctgta ctgccaggtc cgggtcggcg gctgcactgc ggatgagacc ggtgcgactc 60 
atgaaggtgt tcgtcacccg caggataccc gccgagggta gggtcgcgct cgcccgggcg 120 
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gcagactgtg aggtggagca gtgggactcg gatgagccca tccctgccaa ggagctagag 180 

cgaggtgtgg cgggggccca cggcctgctc tgcctcctct ccgaccacgt ggacaagagg 240 

atcctggatg ctgcaggggc caatctcaaa gtcatcagca ccatgtctgt gggcatcgac 300 

cacttggctt tggatgaaat caagaagcgt gggatccgag ttggctacac cccagatgtc 3 60 

ctgacagata ccaccgccga actcgcagtc tccctgctac ttaccacctg ccgccggttg 420 

ccggaggcca tcgaggaagt gaagaatggt ggctggacct cgtggaagcc cctctggctg 480 

tgtggctatg gactcacgca gagcactgtc ggcatcatcg ggctggggcg cataggtgag 540 

gctcccaccg gcccgcttgc ccgccccggc tctcacagcg tggtttgcat ccctggcacc 600 

acgtgtctga aggctgagaa gacccacatg ctgtcagggc actttgcttg cagtagagat 660 

atctctaatg agggatacag ctttgtaaaa cacaggcaaa tacataaata aaccaaagca 720 

gccttcctta agaactcaaa aa 742 

<210> 3 
<211> 1779 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 334751. 2. dec 
<400> 3 

gtgcgctcgc cgtcggctct acctgcgtgc tttagctcct tctcgcctga tccttctgtc 60 

tctcccaacc ccggacaccc ggcttcgact ggttatatct tcggtgttct tttcctctct 120 

tcttctttcg cggttcagca tgcaggaaaa agacgcctcc tcacaaggtt tcctgccaca 180 

cttccaacat ttcgccacgc aggcgatcca tgtgggccag gatccagagc aatggacctc 240 

cagggctgta gtgcccccca tctcactgtc caccacgttc aagcaagggg cgcctggcca 3 00 

gcactcgggt tttgaatata gccgttctgg aaatcccact aggaattgcc ttgaaaaagc 3 60 

agtggcagca ctggatgggg ctaagtactg tttggccttt gcttcaggtt tagcagccac 42 0 

tgtaactatt acccatcttt taaaagcagg agaccaaatt atttgtatgg atgatgtgta 480 

tggaggtaca aacaggtact tcaggcaagt ggcatctgaa tttggattaa agatttcttt 540 

tgttgattgt tccaaaatca aattactaga ggcagcaatt acaccagaaa ccaagcttgt 600 

ttggatcgaa acccccacaa accccaccca gaaggtgatt gacattgaag gctgtgcaca 660 

tattgtccat aagcatggag acattatttt ggtcgtggat aacactttta tgtcaccata 720 

tttccagcgc cctttggctc tgggagctga tatttctatg tattctgcaa caaaatacat 780 

gaatggccac agtgatgttg taatgggcct ggtgtctgtt aattgtgaaa gccttcataa 840 

tagacttcgt ttcttgcaaa actctcttgg agcagttcca tctcctattg attgttacct 900 

ctgcaatcga ggtctgaaga ctctacatgt ccgaatggaa aagcatttca aaaacggaat 960 

ggcagttgcc cagttcctgg aatctaatcc ttgggtagaa aaggttattt atcctgggct 1020 

gccctctcat ccacagcatg agttggtgaa gcgtcagtgt acaggttgta cagggatggt 1080 

caccttttat attaagggca cttcttcagc atgctgagat tttcctcaag aacctaaagc 1140 

tatttactct ggccgagagc ttgggaggat tcgaaagcct tgctgagctt ccggcaatca 1200 

tgactcatgc atcagttctt aagaatgaca gagatgtcct tggaattagt gacacactga 1260 

ttcgactttc tgtgggctta gaggatgagg aagacctact ggaagatcta gatcaagctt 1320 

tgaaggcagc acaccctcca agtggaagtc acagctagta ttccagagct gctattagaa 13 80 

gctgcttcct gtgaagatca aatcttcctg agtaattaaa tggaccaaca atgagccttt 1440 

gcaaaatttt caagcggaaa ttttaaggca cctcattatc tttcataact gtaattttct 1500 

tagggatcat ctctgttaaa aagttttctg tatgtcatgt tataattaca ggtcaattct 1560 

gttaatatct ttttgttaat tttgctctat gtttgcctct gaaggaggtg agatttgtgc 1620 

tactttggga gattatgttc ttttttcatg tctaagattt attttgatca tgtttataat 1680 

ataatggtaa ttcatttttg atgttttgtg aagaatttaa atttaaacga atgttcttaa 1740 

atcaagtgtg atttttttgc atatcattga aaagaacat 177 9 

<210> 4 
<211> 1305 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 237330. 8. dec 

<400> 4 

ggaaatggag aaacgctgaa aacgcacaac actttggttg gttaaattag ggttccgcca 60 
agagccggag cggtgtgcca agtgaaaact acatttccca cggggcagcg ggtcacgtca 120 
cagaggaacg actacgcatg cgtgcaaggt cctccgcgcg cgactacgct cataaaagga 180 

aaaaaagcgt gtgcggttct cgacgtgccg ccaatcttcg aacgcaggtc tgtgatgcat 240 
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ccgcagactc cgaaaaaggg ttcgaggaac gcgcctgctc ccctcgtcgc agtttccagc 3 00 
ccgacgagct tgttttgtcc cggactcggt gcccctgtag acaatggccc tcgtgtctgc 3 60 
cgattcccgc attgcagaac ttctcacaga gctccatcag ctgatcaaac aaacccagga 42 0 
agagcgttcg cggagcgaac acaacttagt gaacatccag aagacccatg agcggatgca 480 
gacagagaac aagatttctc cctattaccg gacaaagctg cgtggcctct acacaaccgc 54 0 
caaggccgat gcagaggctg agtgcaacat ccttcggaaa gctctggaca agatcgcgga 600 
aatcaagtct ctgttggaag agaggcggat tgcggccaag attgccggtc tctacaatga 660 
ctcggagcca ccccggaaga ccatgcgcag aggggtgctg atgaccctgc tgcagcagtc 720 
ggccatgacc ctgcccctgt ggatcgggaa gcctggtgac aagcccccac ccctctgtgg 780 
ggccatccct gcctcaggag actacgtggc cagacctgga gacaaggtgg ctgcccgggt 840 
gaaggccgtg gatggggacg agcagtggat cctggccgag gtggtcagtt acagccatgc 900 
caccaacaag tatgaggtag atgacatcga tgaagaaggc aaagagagac acaccctgag 96 0 
ccggcgccgt gtcatcccgc tgccccagtg gaaggccaac ccggagacgg accctgaggc 1020 
cttgttccag aaggagcagc tcgtgctggc cctgtatccc cagactacct gcttctaccg 1080 
cgccctgatc catgcgcccc cacagcggcc ccaggatgac tactcggtcc tgtttgaaga 1140 
cacctcctat gcagatggct attcccctcc cctcaatgtg gctcagagat acgtggtggc 1200 
ttgtaaggaa cccaagaaaa agtgatgccg cctggcagac tcgccatccc ccaacgacac 12 60 
agggcaggac agcagaggac gtgctgggat taaacacatt ccccc 13 05 

<210> 5 

<211> 1991 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 053778 . 11 . dec 
<220> 

<221> unsure 

<222> 83-84, 87, 97, 129, 131 
<223> a, t, c, g, or other 

<400> 5 

cgcaccgcag aggagaggct tctccaaccc ggcccggccc ttccttcccc tttcccgcag 60 
tcgttgcctc ctcctcccct ctnntcntcc tcccctncct cctcctggcc gcttagtctc 120 
acacccgcng ngccgttgtt cccgagacgt tgttgagtcc cctgtgtcct cttctgggtg 180 
gaggaactgc aatgtctggt ggagaacaga aaccagagag gtactatgtg ggtgtggacg 240 
ttggaacagg cagtgtccgt gcagctctgg tggaccagag tggggtcctg ttggcttttg 300 
cagaccagcc aattaagaat tgggagcccc agttcaacca ccatgagcag tcctccgagg 3 60 
acatctgggc tgcgtgctgt gttgtcacaa agaaagttgt acaagggatt gatttaaacc 420 
aaattcgagg acttgggttt gatgccacgt gttctctggt tgttttggat aagcagtttc 480 
acccattacc agtcaaccag taaggggatt cccatcgaaa cgtcatcatg tggctggacc 540 
atcgagcagt cagtcaagtt aacaggatca atgagaccaa gcacagtgtc ctccagtacg 600 
tcgggggggt gatgtctgtg gaaatgcagg ccccgaaact tctgtggctg aaagagaact 660 
tgagagagat ttgctgggat aaggcgggac atttctttga tctcccggac ttcttatcgt 720 
ggaaggcaac aggtgtcaca gcacggtctc tctgctccct ggtgtgtaag tggacatatt 780 
cagcagagaa aggctgggac gacagtttct ggaaaatgat tggtttggaa gactttgttg 840 
cagataatta cagcaaaata ggaaaccaag tgctacctcc tggagcttct cttggaaatg 900 
ggctcacacc agaggcagca agagaccttg gccttctccc tgggattgcg gtcgcagctt 960 
cactcattga tgcccatgca ggaggactag gagtgattgg ggcagatgtg agagggcacg 1020 
gcctcatctg tgaggggcag ccagtgacgt cacggctggc tgtcatctgt ggaacgtctt 1080 
cttgtcacat ggggatcagc aaagacccga tttttgtacc aggcgtctgg gggccttatt 1140 
tctcagccat ggtacctggg ttctggctga atgaaggtgg tcagagcgtt actggaaaat 1200 
tgatagacca catggtacaa ggccatgctg cttttccaga actacaagta aaggccacag 1260 
ccagatgcca gagtatatat gcatatttga acagtcacct ggatctgatt aagaaggctc 1320 
agcctgtggg tttccttact gttgatttac atgtttggcc agatttccat ggcaaccggt 13 80 
ctcccttagc agate tgaca ctaaagggca tggtcacegg attgaaactg tctcaggacc 1440 
ttgatgatct tgccattctc tacctggcca cagttcaagc cattgetttg gggactcget 1500 
tcattataga agecatggag gcagcagggc actcaatcag tactcttttc ctatgtggag 1560 
gcctcagcaa gaatcccctt tttgtgcaaa tgcatgcgga cattactggc atgcctgtgg 1620 
tcctgtcgca agaggtggag tccgttcttg tgggtgctgc tgttctgggt gcctgtgcct 1680 
caggggattt cgcttctgta caggaagcaa tggcaaaaat gagcaaagtt gggaaagttg 1740 
tgttcccgag actacaggat aaaaaatact atgataagaa ataccaagta ttcctgaagc 1800 
tggttgaaca ccagaaggag tatttggega tcatgaatga tgactgaaca gggcttgcag 1860 
gtgctgatgc cagaagcttc tgtgccattg cattaaagac ttctgtcatt tgatccatgt 1920 
tcaagaccct tgaggtattg tttcatcatt tctgtattgt ctttcaataa agaaaacaaa 1980 
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catgtgcaac c 1991 

<210> 6 

<211> 2055 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; 360645 . 10 . dec 
<400> 6 

cggaggcgag cggagggttt cccgcggcgg atttctgaca gtcagacttg tccacaagaa 60 
ctcaactggc aaggctgctt ttctgtgcta aaactgggga gctagtgggc accatgaaga 12 0 
tcttctgcag tcgggccaat ccgaccacgg ggtctgtgga gtggctggag gaggatgaac 180 
actatgatta ccaccaggag attgcaaggt catcttatgc agatatgcta catgacaaag 240 
acagaaatgt aaaatactac caaggtatcc gggctgccgt gagcagggtg aaggacagag 3 00 
gacagaaggc cttggttctc gacattggca ctggcacggg actcttgtca atgatggcgg 3 60 
tcacagcagg tgccgacttc tgctatgcca tcgaggtttt caagcctatg gctgatgctg 420 
ctgtgaagat tgtggagaaa aatggcttta gtgataagat taaggttatc aacaagcatt 480 
ccaccgaggt gactgtaggt ccagagggtg acatgccatg ccgtgccaac atcctggtca 540 
cagagttgtt tgacacagag ctgatcgggg agggggcgct gccctcctat gagcacgcac 60 0 
acaggcatct cgtggaggaa aattgtgagg ccgtgcccca cagagccacc gtctatgcac 660 
agctggtgga gtccgggagg atgtggtcgt ggaacaagct atttcccatc cacgtgcaga 72 0 
ccagcctcgg agagcaggtc atcgtccctc ccgttgacgt ggagagctgc cctggcgcac 780 
cctctgtctg tgacattcag ctgaaccagg tgtcaccagc cgactttaca gtcctcagcg 840 
atgtgctgcc catgttcagc atagacttca gcaagcaagt cagtagctca gcagcctgcc 900 
atagcaggcg gtttgaacct ctgacatctg gccgagctca ggtggttctc tcgtggtggg 960 
acattgaaat ggaccctgag gggaagatca agtgcaccat ggcccccttc tgggcacact 1020 
cagacccaga ggagatgcag tggcgggacc actggatgca gtgtgtgtac ttcctgccac 1080 
aagaggagcc tgtggtgcag ggctcagcgc tctatctggt agcccaccac gatgactact 1140 
gcgtatggta cagcctgcag aggaccagcc ctgaaaagaa tgagagagtc cgccagatgc 1200 
gccccgtgtg tgactgccag gctcacctgc tctggaaccg gcctcggttt ggagagatca 12 60 
atgaccagga cagaactgat cgatacgtcc aggctctgag gaccgtgctg aagccagaca 1320 
gcgtgtgcct gtgtgtcagc gatggcagcc tgctctccgt gctggcccat cacctggggg 1380 
tggagcaggt gtttacagtc gagagttcag cagcttctca caaactgttg agaaaaatct 1440 
tcaaggctaa ccacttggaa gataaaatta acatcataga gaaacggccg gaattattaa 1500 
caaatgagga cctacagggc agaaaggtct ctctcctcct gggcgagccg ttcttcacta 1560 
ccagcctgct gccgtggcac aacctctact tctggtacgt gcggaccgct gtggaccagc 1620 
acctggggcc aggtgccatg gtgatgcccc aggcagcctc gctgcacgct gtggttgtgg 1680 
agttcagggt gtgcagggaa cagcaagatg tgcctcttgt tcttgctgcc acgcttccct 1740 
gtgtcctggc gggcgggtgt ggatggggct gctccttcct cacaggacct gtggcggatc 1800 
cggagcccct gtggtgactg cgaaggcttc gacgtgcaca tcatggatga catgagttag 1860 
gtaggcaggg ccacactctg catagtcccc ccgacctgct cctgtatcgc aggcctctca 1920 
cagggtccca gcttgggcag cacaggctct tctgttgggg gcagtgaggt caggtgctgc 1980 
cattttgtgt ggttcaacat gagcattgct tggtaccagc cctgttcttg gctccgtgct 2040 
gtcaccctgt gtcag 2055 

<210> 7 

<211> 1862 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 334808.1. dec 

<400> 7 

gaattccggg ccgaaggtgc ctgggcttgc tcattcagtc acagtcacag ccaccatgcc 60 
agggaggacc tgggagctgt gcctgctact gctgctgggg ctgggactgg ggtcccagga 120 
ggccctaccc ccaccctgtg agagtgagat ttactgccac ggggagctcc taaaccaagt 180 
tcaaatggcc aagctctacc aggatgacaa gcagtttgtg gacatgccac tgtctatagc 240 
tccagaacaa gtcctgcaga ccttcactga gctgtccagg gaccacaatc acagcatccc 300 
cagggagcag ctgcaggcgt ttgtccacga acacttccag gccaaggggc aggagctgca 3 60 
gccctggacc cctgcagact ggaaagacag cccccagttc ctgcagaaga tttcagatgc 420 
caaactgcgt gcctgggcag ggcagctgca tcagctctgg aagaagctgg ggaagaagat 480 
gaagccagag gttctcagcc accctgagcg gttctctctc atatactcag aacatccttt 540 
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cattgtgcct ggcggtcgct ttgttgagtt ctactactgg gactcctact gggtcatgga 600 
gggtctgctc ctctcagaga tggctgagac ggtgaagggc atgctgcaga acttcttgga 660 
cctggtgaaa acctatgggc atgtccccaa tggtgggcgc gtgtactacc tgcagcggag 720 
ccagccccca ctcttgaccc tcatgatgga ttgctacttg actcacacca atgacaccgc 780 
ctttctacag gaaaacattg aaacactagc cttggaattg gacttttgga ccaagaacag 840 
gactgtctct gtgagcttgg agggaaagaa ctacctcctg aatcgctatt atgtccctta 900 
tgggggaccc aggcctgagt cctacagcaa agatgtggag ttggctgaca ccttgccaga 960 
aggagaccgg gaggctctgt gggctgagct caaggctggg ggctgagtct ggctgggact 1020 
cctcttcacg ctggctcatt ggaggcccaa accccaactc gcttagcggc atccgaacaa 1080 
gcaaactggt gcctgttgac ctgaatgcct tcctatgcca agcagaggag ctgatgagca 1140 
acttctattc caggctgggg aacgactccc aggccacgaa gtacagaatc ctgcggtcgc 12 00 
agcgcttggc cgccctgaac acagtcctgt gggatgagca gaccggagcc tggttcgatt 1260 
acgaccttga gaagaagaag aaaaaccggg agttttaccc atccaacctc actccactct 1320 
gggccgggtg tttctctgac cctggcgtgg cggacaaggc tctgaaatac ctggaggaca 1380 
accggatcct gacttaccag tatgggatcc cgacctctct ccagaagaca ggccagcagt 1440 
gggatttccc caatgcctgg gcccccctgc aggacctggt catcagaggc ctggccaagg 1500 
cacctttacg tcgggcccag gaagtggctt tccagctggc tcagaattgg atccgaacca 1560 
attttgatgt ctactcgcag aagtcagcca tgtatgagaa gtatgacgtc agcaacggtg 1620 
gacagcccgg tgggggagga gaatatgaag ttcaggaggg atttggctgg acgaatggtg 1680 
tggtcctgat gctgctggac cgctatggtg accggctgac ctcaggggcc aagctggctt 1740 
tcctggagcc ccactgcctg gcggccaccc ttctgcccag cctcctgctc agcctcctgc 1800 
catggtgaca gccctcctct cctcacctgg ccccagctcc tgccccatta aacctctgca 1860 
cc 1862 

<210> 8 
<211> 1879 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 997089 .7 .dec 

<400> 8 

ggccgggaca cgtggtacgg aaccggcgcc gcgcttgctg ctggtaacag ggccttgcct 60 
agtgggcctt ccttcccaga accttcgaga tctgcggtct ggggtctggt tgaaagatgg 120 
cggccctcac taccctgttt aagtacatag atgaaaatca ggatcgctac attaagaaac 180 
tcgcaaaatg ggtggctatc cagagtgtgt ctgcgtggcc cggagaagag aggcgaaatc 240 
aggaggatga tggaagttgc tgctgcagat gttaagcagt tggggggctc tgtggaactg 3 00 
gtggatatcg gaaaacaaaa gctccctgat ggctcggaga tcccgctccc tcctattctg 3 60 
ctcggcaggc tgggctccga cccacagaag aagaccgtgt gcatttacgg gcacctggat 420 
gtgcagcctg cagccctgga ggacggctgg gacagcgagc ccttcaccct ggtggagcga 480 
gacggcaagc tgcatgggag aggttcgact gatgataagg gcccggtggc cggctggata 540 
aacgccctgg aagcgtatca gaaaacaggc caggagattc ctgtcaacgt ccgattctgc 600 
ctcgaaggca tggaggagtc aggctctgag ggcctagacg agctgatttt tgcccggaaa 660 
gacacattct ttaaggatgt ggactacgtc tgcatttctg acaattactg gctgggaaag 720 
aagaagccct gcatcaccta cggcctcagg ggcatttgct actttttcat cgaggtggag 780 
tgcagcaaca aagacctcca ttctggggtg tacgggggct cggtgcatga ggccatgact 840 
gatctcattt tgctgatggg ctctttggtg gacaagaggg ggaacatcct gatccccggc 900 
attaacgagg ccgtggccgc cgtcacggaa gaggagcaca agctgtacga cgacatcgac 960 
tttgacatag aggagtttgc caaggatgtg ggggcgcaga tcctcctgca cagccacaag 102 0 
aaagacatcc tcatgcaccg atggcggtac ccgtctctgt ccctccatgg catcgaaggc 1080 
gccttctctg ggtctggggc caagaccgtg attcccagga aggtggttgg caagttctcc 114 0 
atcaggctcg tgccgaacat gactcctgaa gtcgtcggcg agcaggtcac aagctaccta 1200 
actaagaagt ttgctgaact acgcagcccc aatgagttca aggtgtacat gggccacggt 1260 
gggaagccct gggtctccga cttcagtcac cctcattacc tggctgggag aagagccatg 132 0 
aagacagttt ttggtgttga gccagacttg accagggaag gcggcagtat tcccgtgacc 1380 
ttgacctttc aggaggccac gggcaagaac gtcatgctgc tgcctgtggg gtcagcggat 1440 
gacggagccc actcccagaa tgaaaagctc aacaggtata actacataga gggaaccaag 1500 
atgctggccg cgtacctgta tgaggtctcc cagctgaagg actaggccaa gccctctgtg 1560 
tgccatctcc aatgagaagg aatcctgccc tcacctcacc cttttccaac ttgcccaggg 1620 
aagtggaggt tccctctttc ctttccctct tgtcaggtca tccatgactt tagagaacag 1680 
acacaagtgt atccagctgt ccacgggtgg agctacccgt tgggcttatg agtgacctgg 174 0 
agtgacagct gagtcaccct gggtaagttc tcagagtggt caggatggct tgacctgcag 1800 
aagataccca aggtccaaaa gcacaaggtc tgcggaaagt tctggttgtc ggctggacac 1860 
cacggctcac acctataat yyy J 8 79 
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<210> 9 
<211> 1517 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 237152.1. dec 

<220> 

<221> unsure 

<222> 1059, 1076 

<223> a, t, c, g, or other 



<400> 9 

gtcccggccg cgcggagcgg acatgtgcag gctgggctag gagccgccgc ctccctcccg 60 
cccagcgatg tattcagcgc cctccgcctg cacttgcctg tgtttacact tcctgctgct 120 
gtgcttccag gtacaggtgc tggttgccga ggagaacgtg gacttccgca tccacgtgga 180 
gaaccagacg cgggctcggg acgatgtgag ccgtaagcag ctgcggctgt accagctcta 240 
cagccggacc agtgggaaac acatccaggt cctgggccgc aggatcagtg cccgcggcga 3 00 
ggatggggac aagtatgccc agctcctagt ggagacagac accttcggta gtcaagtccg 3 60 
gatcaagggc aaggagacgg aattctacct gtgcatgaac cgcaaaggca agctcgtggg 420 
gaagcccgat ggcaccagca aggagtgtgt gttcatcgag aaggttctgg agaacaacta 480 
cacggccctg atgtcggcta agtactccgg ctggtacgtg ggcttcacca agaaggggcg 540 
gccgcggaag ggccccaaga cccgggagaa ccagcaggac gtgcatttca tgaagcgcta 600 
ccccaagggg cagccggagc ttcagaagcc cttcaagtac acgacggtga ccaagaggtc 660 
ccgtcggatc cggcccacac accctgccta ggccaccccg ccgcggcccc tcaggtcgcc 720 
ctggccacac tcacactccc agaaaactgc atcagaggaa tatttttaca tgaaaaataa 780 
ggaagaagct ctatttttgt acattgtgtt taaaagaaga caaaaactga accaaaactc 840 
ttggggggag gggtgataag gattttattg ttgacttgaa acccccgatg acaaaagact 900 
cacgcaaagg gactgtagtc aacccacagg tgcttgtctc tctctaggaa cagacaactc 960 
taaactcgtc cccagaggag gacttgaatg aggaaaccaa cactttgaga aaccaaagtc 1020 
ctttttccca aaggttctga aaggaatcaa aaaaaaaanc aaaaaaaaag aaaacncaaa 1080 
gagaaagtag tactccgccc accaacaaac tccccctaac tttcccaatc ctctgttcct 1140 
gccccaaact ccaacaaaaa tcgctctctg gtttgcagtc atttatttat tgtccgctgc 1200 
aagccgcccc gagacaccgc gcagggaagg cgtgcccctg ggaattctcc gcgcctcgac 1260 
ctcccgacga cagacgcctc gtccaatcat ggtgaccctg ccttgctcgc agttctggag 1320 
gatgctgcta tcgaccttcc gtgactcacg tgacctagta caccaatgat aagggaatat 1380 
tttaaaacca gctatattat atatattata tatatataag ctatttattt cacctctctg 1440 
tatattgcag tttcatgaac caagtattac agccacaaca attaaaaaca acagacaaat 1500 
tatttaaaaa accaaaa 1517 



<210> 10 
<211> 1815 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis c_f eature 

<223> Incyte ID No: 232851. 7. dec 

<400> 10 

gaagtttacc ctgttcagca gaagctgaga tgggaacagg aaacccacag ggccccttta 60 
ttcggcaaaa atgtcagtca gcgccccggg gagcagccga gggtccctga gtgtccgcag 12 0 
caatgggcgc tgagttcctc tgctggagtt catcctgcta gctgggttcc cgagctgccg 180 
gtctgagcct gaggcatgga gcctcctgga gactgggggc ctcctccctg gagatccacc 240 
cccagaaccg acgtcttgag gctggtgctg tatctcacct tcctgggagc cccctgctac 3 00 
gccccagctc tgccgtcctg caaggaggac gagtacccag tgggctccga gtgctgcccc 3 60 
aagtgcagtc caggttatcg tgtgaaggag gcctgcgggg agctgacggg cacagtgtgt 420 
gaaccctgcc ctccaggcac ctacattgcc cacctcaatg gcctaagcaa gtgtctgcag 480 
tgccaaatgt gtgacccagc catgggcctg cgcgcgagcc ggaactgctc caggacagag 540 
aacgccgtgt gtggctgcag cccaggccac ttctgcatcg tccaggacgg ggaccactgc 600 
gccgcgtgcc gcgcttacgc cacctccagc ccgggccaga gggtgcagaa gggaggcacc 660 
gagagtcagg acaccctgtg tcagaactgc cccccgggga ccttctctcc caatgggacc 72 0 
ctggaggaat gtcagcacca gaccaaccga gcttggaaaa gtcagacaga cctctgaggt 780 
ctcatcctgg agctgccacc agcccagcct ccctgggacc tgtcttcact gcctggggcc 840 
ctgggagcca gggaggctcc ctgaggctga gtgaacactg ggcgctgcac ctgcctctcc 900 
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cacgtcctcg gccccactcc cgcaggtgca gctggctggt gacgaaggcc ggagctggga 960 

ccagcagctc ccactgggta tggtggtttc tctcagggag cctcgtcatc gtcattgttt 1020 

gctccacagt tggcctaatc atatgtgtga aaagaagaaa gccaaggggt gagcacacgg 1080 

cggccccatc agggctcatg tccccagccg tcacctcttg gagctctgtc accccaagcc 1140 

tgggaggtgg ccccagagct tttccaggat ccgcggctcc tcccagggca gccactgcag 1200 

gctggggcag gtgatgtagt caaggtgatc gtctccgtcc aggtattgat cctcctcccc 1260 

ctctccctcc cccctccacc ttcccacctc ccctctcccc gctggggctg gtgtttctgg 1320 

tgtacatggt gggggctccc agttctctga gggtcctgag tctttcaagt acagccacgg 1380 

tagctcagga aagaacccac cccctcaaac tgaaagcagt aaaatgaacc cgagaacctg 1440 

gagtcccagg ggggcctgag caggcagggt ctccacgatt cgtgtgctca cagcggaaaa 1500 

gacaggaggc agaaggtgag gccacagtca ttgaggccct gcaggcccct ccggacgtca 1560 

ccacggtggc cgtggaggag acaataccct cattcacggg gaggagccca aaccactgac 1620 

ccacagactc tgcaccccga cgccagagat acctggagcg acggctgctg aaagaggctg 1680 

tccacctggc ggaaccaccg gagcccggag gcttgggggc tccgccctgg gctggcttcc 1740 

gtctcctcca gtggagggag aggtggggcc ccctgctggg gtagagctgg ggacgccacg 1800 

tgccattccc atggc 1815 

<210> 11 

<211> 2382 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 083804,1. dec 
<220> 

<221> unsure 
<222> 2042 

<223> a, t, c, g, or other 
<400> 11 

ctccagagag gctgctgctc attgagctgc actcacatga ggatacagac tttgtgaaga 60 
aggaattggc aacactgaaa cctccagaac aaaggctgtc actaaggtcc cgctgccttg 120 
atggattata cacttgacct cagtgtgaca acagtgaccg actactacta ccctgatatc 180 
ttctcaagcc cctgtgatgc ggaacttatt cagacaaatg gcaagttgct ccttgctgtc 240 
ttttattgcc tcctgtttgt attcagtctt ctgggaaaca gcctggtcat cctggtcctt 300 
gtggtctgca agaagctgag gagcatcaca gatgtatacc tcttgaacct ggccctgtct 3 60 
gacctgcttt ttgtcttctc cttccccttt cagacctact atctgctgga ccagtgggtg 420 
tttgggactg taatgtgcaa agtggtgtct ggcttttatt acattggctt ctacagcagc 480 
atgtttttca tcaccctcat gagtgtggac aggtacctgg ctgttgtcca tgccgtgtat 540 
gccctaaagg tgaggacgat caggatgggc acaacgctgt gcctggcagt atggctaacc 600 
gccattatgg ctaccatccc attgctagtg ttttaccaag tggcctctga agatggtgtt 660 
ctacagtgtt attcatttta caatcaacag actttgaagt ggaagatctt caccaacttc 720 
aaaatgaaca ttttaggctt gttgatccca ttcaccatct ttatgttctg ctacattaaa 780 
atcctgcacc agctgaagag gtgtcaaaac cacaacaaga ccaaggccat caggttggtg 840 
ctcattgtgg tcattgcatc tttacttttc tgggtcccat tcaacgtggt tcttttcctc 900 
acttccttgc acagtatgca catcttggat ggatgtagca taagccaaca gctgacttat 9 60 
gccacccatg tcacagaaat catttccttt actcactgct gtgtgaaccc tgttatctat 1020 
gcttttgttg gggagaagtt caagaaacac ctctcagaaa tatttcagaa aagttgcagc 1080 
caaatcttca actacctagg aagacaaatg cctagggaga gctgtgaaaa gtcatcatcc 1140 
tgccagcagc actcctcccg ttcctccagc gtagactaca ttttgtgagg atcaatgaag 1200 
actaaatata aaaaacattt tcttgaatgg catgctagta gcagtgagca aaggtgtggg 1260 
tgtgaaaggt ttccaaaaaa agttcagcat gaaggatgcc atatatgttg ttgccaacac 1320 
ttggaacaca atgactaaag acatagttgt gcatgcctgg cacaacatca agcctgtgat 13 80 
tgtgtttatt gatgatgttg aacaagtggt aactttaaag gattctgtat gccaagtgaa 1440 
aaaaaaagat gtctgacctc cttacatatg caaaaatata ccttcagaga ctgtcagtag 1500 
gctggaagaa gtggatattg aagttttgac atcaatgatg aggctccagt tgtctattca 1560 
ttgactgatg gtgaaatggc tggagtgatt ctgaatcaag gtgattgtga ttatagtgac 1620 
aatgaagatg atgctattaa tactgcataa aaagtgcctg tagatgacat ggtgaaaata 1680 
tttgacaggc ttatggaagg actacagcag cacgcattca taacagaaca agaaattatg 1740 
tcagcttata aaatcaaaca gagacttcta gacaaaaacc attgttgatg aggcagatgc 1800 
ctctagaaga gacgtttaaa agccatcaaa cacaatgcct catcttccct ggaggaccca 1860 
cttcctgatc cctcaactgt gtctgatgtt tcttctcatg taagaaataa aaaataaaaa 1920 
taaaaaaata tatattggta tgtaactaca ggaaaaaaat aaaaaatata tagtggacag 1980 
taacctttca atcaaaacac agcattgtaa gtggagactg aaagactacc attgcttgtt 2040 
antgctgttg cttaacagct gatacaggta ttctggtgat gctactgtgc tgcctagtta 2100 



8/50 



WO 01/21836 



PCT/US00/25643 



cccccaacac atgatttttt cactgtaata gtggtatgtc atgttgttta ctcttaagta 2160 
cttatgtatg aataagtgta agaaaatgat tgcttatcag tagtatcaat gatttactca 2220 
atatctgaat caccttgatt cagaaccatt tcagctgttt caccatcagt caatgaataa 2280 
cagcctcatt gatgtcaaaa acttcaatat ccacttcttt cagcctactg tagactctgg 2340 
aagtatactt tttgcatatg taaggaagtc agattttttt tt ~ 2382 

<210> 12 

<211> 3385 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 272721. 6. oct 
<400> 12 

gaggaaagat ggaggtgtgg ggacaggagc tgggtgtgct ggggactggc cgcggacccc 60 
taacctgtgt ctccggtctc cctccgggag cggctcaacc cagcccatcg ctctggaccc 120 
cgttctggcc ctgcagggtg gtggttggga cgttgaaatg agcgcgcgag tggtacgtcc 180 
tctctccggc gctcacgccc cccttcctca ccgtgtttcc cgccaggacc atcagcacgt 240 
gcccatcgac atccagacca gcaagctgct cgattggctg gtggacagaa ggcactgcag 3 00 
cctgaaatgg cagagtctgg tgctgacgat ccgcgagaag atcaatgctg ccatccagga 360 
catgccagag agcgaagaga tcgcccagct gctgtctggg tcctacattc actactttca 420 
ctgcctaaga atcctggacc ttctcaaagg cacagaggcc tccacgaaga atatttttgg 480 
ccgatactct tcacagcgga tgaaggattg gcaggagatt atagctctgt atgagaagga 540 
caacacctac ttaggtaaag tggcccggcc tgggagccct ggtatccatg gggaagccca 600 
ctctcagagt tctgagatac caggcttata ggaggcacag tctgtgagtg ggaagagact 660 
ggagtgtaga tgttgcccat ttgtaggtgg taaaatcaat tgtttttgat ggaattgatt 72 0 
ttccctgagt ggagtgctgg gggaaggagg aggtccaggc cggtagtggc cattcgccgt 780 
gcctcagcga gcaggtgtgt gtgggtcctc caccactcac ctcttggtta gcgggagtgt 840 
gctgccccca cccccacccc cgcaccccca ttctacacaa ggcagaagag gcacgggttt 900 
tcctgggagc gaatatcaag tgcctgagag caactacagg actaactgtg tttgggttgg 960 
gtgtagtata aataataata atggctaata tttcctgagc atctactaaa tgcaaggaat 1020 
tgtgcttggt gtgtcatgtg gattctctct tgcatcttca tgataaatgt tattgtcgct 1080 
gttttaccga tgagggttgg attagagggg ttaaacaact tgtccttagg ctccacagct 1140 
gggaacaagt ggggctggga agctgacttc gtgctcttca ccaccacaaa ggatgtgtgt 1200 
gcatcctggg gcatgcctgc ctcatgtggg ggtgtcctgg gctgaatttc ctgggcactt 1260 
ctcagtggaa ctctctagcc tcctggttcg gaatgtcaac tatgagatcc cctcactgaa 1320 
gaagcagatt gccaagtgcc agcagctgca gcaagaatac agccgcaagg aggaggagtg 1380 
ccaggcaggg gctgccgaga tgcgggagca gttctaccac tcctgcaagc agtatggcat 1440 
cacgggcgaa aatgtccgag gagaactgct ggccctggtg aaggacctgc cgagtcagct 1500 
ggctgagatt ggggcagcgg ctcagcagtc cctgggggaa gccattgacg tgtaccaggc 1560 
gtctgtgggg tttgtgtgtg agaggtagag aggcctcaag cttctcctgg tgggggtgct 1620 
ttgcctgtgt tccccagctc atgacccttc tccagttgtc ttgttcccat ataacatttg 1680 
aactctttac acadctgaac ctgtgggggc cttgcccatt tgaccatgtg gcccaggcca 1740 
aagcccagtg ttggccttac gcatggttcg gcaggagagt cagttgtgtg ctctgttgaa 1800 
gccccacaga gcaggtgttg ccaatgctgc ggttcgtgca gaagcgggga aactcaacgg 1860 
tgtacgagtg gaggacaggg acagagccct ctgtggtgga acgaccccac ctcgaggagc 1920 
ttcctgagca ggtggcagaa gatgcgattg actggggcga ctttggggta gaggcagtgt 1980 
ctgaggggac tgactctggg catctctgcc gaggctgctg gaatcgactg gggcatcttc 2040 
ccggaatcag attcaaagga tcctggaggt gatgggatag actggggaga cgatgctgtt 2100 
gctttgcaga tgcacagtgc tggaagcagg aacccaggct ccagaaggtg ttgccagggg 2160 
cccagatgcc ctgacactgc ttgaatacac tgagacccgg aatcagttcc ttgatgagct 2220 
catggagctt gagatcttct tagcccagag agcagtggag ttgagtgagg aggcagatgt 2280 
cctgtctgtg agccagttcc agctggctcc agccatcctg cagggccaga ccaaagagaa 2340 
gatggttacc atggtgtcag tgctggagga tctgattggc aagcttacca gtcttcagct 2400 
gcaacacctg tttatgatcc tggcctcacc aaggtatgtg gaccgagtga ctgaattcct 2460 
ccagcaaaag ctgaagcagt cccagctgct ggctttgaag aaagagctga tggtgcagaa 2520 
gcagcaggag gcacttgagg agcaggcggc tctggagcct aagctggacc tgctactgga 2580 
gaagaccaag gagctgcaga agctgattga agctgacatc tccaagaggt acagcgggcg 2640 
ccctgtgaac ctgatgggaa cctctctgtg acaccctccg tgttcttgcc tgcccatctt 2700 
ctccgctttt gggatgaaga tgatagccag ggctgttgtt ttggggccct tcaaggcaaa 27 60 
agaccaggct gactggaaga tggaaagcca caggaaggaa gcggcacctg atggtgatct 2820 
tggcactctc catgttctct acaagaagct gtggtgattg gccctgtggt ctatcaggcg 2880 
aaaaccacag attcctcttt tattaagtcc gctatactaa ctagaaggag aatctgtggt 2940 
tttcgcctga tagaccacag ggccaatcac cacagcttct tgtagagaac atggagagtg 3000 
ccaagatcac catcaggtgc cgcttccttc ctgtggcttt ccatcttcca gtcagcctgg 30 60 
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tcttttgcct cttggagatg tcagcttcaa tcagcttctg cagctccttg gtcttctcca 3120 

gtagcaggtc cagcttaggc tccagagccg cctgctcctc aagtgcctcc tgctgcttct 3180 

gcaccatcag ctctttcttc aaagccagca gctgggactg cttcagcttt tgctggagga 3240 

attcagtcac tcggtccaca taccttggtg aggccaggat cataaacagg tgttgcagct 33 00 

gaagactggt aagcttgcca atcagatctc cagcactgac accatggtaa ccatcttctc 33 60 
tttggtctgg cctgcaagga tggct 33 85 

<210> 13 

<211> 3111 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 461603. 4. oct 
<220> 

<221> unsure 

<222> 1605, 2212 

<223> a, t, c, g, or other 

<400> 13 

ttttactgta caaatgcttt atttctattc aatatttaga agacagttat aaacaagatg 60 

cattcaatag catggtggca gatgaacatc aggaaggaac atccatgagc ttccatccac 12 0 

ggaacctcac catggatacg cttgtgatca agggcctggt ctcccctcaa gacacggtca 180 

cagatcagag gccacaccat cctagcagtg gagcaggacc agctgggaca gggtccttct 240 

gtgacacctg ctgcatcacc aggctgggtg aacggacaca attgccagaa ctcacagaat 3 00 

agaagtatca gcaccgaaac ctcacaggaa aaatggtaag ttctaagttt ctccattaat 3 60 

agtaactctc agattaatct ctgtcatcca tcgcttctcc aagaaatgac tttttagggt 42 0 

gatgtgccag gcgccatgtt ggagggctgg tggtagcggc ttggggaggt gctcactctg 480 

tcggtctcac tctctcacac gcttcccctg gctcccttcg ttccccccca ccccacttgg 540 

cctgcgtgct ggagggtgtg cgagggagtg ggagggcgtc ggggggtggg gggaggcgtt 600 

ccggtcccca agagacccgc ggagggaggc ggaggctgtg agggactccg ggaagccatg 660 
gacgtcgaga ggctccagga ggcgctggaa gattttgaga agagggggaa aaaggaagtt 720 

tgtcctgtcc tggatcagtt tctttgtcat gtagccaaga ctggagaaac aatgattcag 780 
tggtcccaat ttaaaggcta ttttattttc aaactggaga aagtgatgga tgatttcaga 840 

acttcagctc ctgcgccaag aggtcctccc aaccctaatg tcgaatatat tccctttgat 9 00 

gaaacaaagg gaagaatact gaaaactgtc actgggattt aatggtatcc cttttactat 9 60 

tcagcgacta tgtgaattgt taacagatcc aaggagaaac tatacaggaa cagacaaatt 1020 

tctcagagga gtagaaaaga atgtgatggt tgttagctgt gtttatcctt cttcagagaa 1080 

aaacaattcc aatagtttaa atcgaatgaa tggtgtgatg tttcctggaa attcaccaag 1140 

ctatactgag aggtctaata taaatgggcc tgggacaccc agcaacctta atcgaccaaa 1200 

ggtttctttg tcagcccccc atgaccaaca aatgggttgc ctgagagcac agacagcaaa 1260 

aaggaaggcc aaaaaatttt tgcccccagg ccaaaaaaaa ttggaaggga agaaaaaaaa 13 20 

aatcacagtg actcttcgac ctctgaatca gaagtttcct cagtgagccc tttgaaaaat 13 80 

aaacatccag atgaagatgc tgtggaagct gaggggcatg aggtaaaaag actcaggttt 1440 

gacaaagaag gtgaagtcag agaaacagcc agtcaaacga cttccagcga aatttcttca 1500 

gttatggtag gagaaacaga agcatcatct tcatctcagg ataaagacaa agatagccgt 1560 

tgtacccggc agcactgtac agaagaggat gaagaagagg atgangagga agaagaagag 1620 

tcttttatga catcaagaga aatgatccca gaaagaaaaa atcaagaaaa agaatctgat 1680 

gatgccttaa ctgtgaatga agagacttct gaggaaaata atcaaatgga ggaatctgat 1740 

gtgtctcaag ctgagaaaga tttgctacat tctgaaggta gtgaaaacga aggccctgta 1800 

agtagtagtt cttctgactg ccgtgaaaca gaagaattag taggatccaa ttccagtaaa 1860 

actggagaga ttctttcaga atcatccatg gaaaatgatg acgaagccac agaagtcacc 1920 

gatgaaccaa tggaacaaga ctaactattt agaaacattt agatgcagta ttttacatac 1980 

agttctggtt ttaacactgt ataaaacttt tgtgtaataa aatggacctt tagttttaca 2040 

agagaagcag gttgtaaaat aaagtacttt atggataatt cctgaaagag ttgtacatgt 2100 

aagaactgtg aatatcagct cctctgggtc ctgcttacct taccgctgac ttttctttct 2160 

ttcttttttt ggtctgggca aatcagtggt ttgtgtatag attttttttt tnctttttaa 2220 

tttaggattg aagtttttaa actggaaggt aattacaatt ttgaaaagtt ttttgagatt 2280 

atcacattta gtttatacat atgcaagaag ctttttgtct tgtctctttc tgatagctct 2340 

agcagttttc atattttggt catagtttca acattttaac atgtgaataa tagagtttca 2400 

tgctggtttc cagatgttat tgttcagcta catacaatgg aacattaagt tatattctaa 2460 

ggggggaaat gttatatttt tctgtttcta taagagatga atacagtgga tactttttct 2520 

attggtaatg attgagttca cctctttcag aagacatttt ctttctcttc tgagtaactg 2580 

aaataaaatc tggcctttgt gaaaccctgg aaataccacg acccacaact agaaacacca 2640 

ataccagctc ctccgcgagt ttccagctcc acaacctaag acatcagagg cagcattggt 27 00 
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tcctcacgta gagtccagct ccgggaccct catatttgaa ccgcagggcc atctcatccc 2760 
tggatctcca gctgcaccac actcaaatta gaacaacatc agttcctccc caggtctcca 2820 
cctgcacagc cctcgaaagg gaacgtcagc tcctccccgg gtctccagct gtaggtccct 2880 
aaaactagaa catcagctcc cgcctgggtc gccagcagca ccaccctcaa actggaacat 2940 
cagatcccca cgggtctcca gctgcagggc cctcaaactg gaacatcagc tccccaccag 3 000 
atctccagct gcacggacct caaactggaa catcagctcc ccgccgggtc tccagctgca 3 060 
ctgcctgcaa actggaacat gagctccctg ccgggtctcc agctgcatgg t 3111 

<210> 14 

<211> 2980 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> IncytG ID No: 332465. 2. dec 
<400> 14 

agcgacgggg aaattcaaac gtgtttgcgg aaaggagttt gggttccatc ttttcatttc 60 
cccagcgcag tttctgtaga aatggaatcc gaggatttaa gtggcagaga attgacaatt 120 
gattccataa tgaacaaagt gagagacatt aaaaataagt ttaaaaatga agaccttact 180 
gatgaactaa gcttgaataa aatttctgct gatactacag ataactcggg aactgttaac 240 
caaattatga tgatggcaaa caacccagag gactggttga gtttgttgct caaactagag 3 00 
aaaaacagtg ttccgctaag tgatgctctt ttaaataaat tgattggtcg ttacagtcaa 3 60 
gcaattgaag cgcttccccc agataaatat ggccaaaatg agagttttgc tagaattcaa 420 
gtgagatttg ctgaattaaa agctattcaa gagccagatg atgcacgtga ctactttcaa 480 
atggccagag caaactgcaa gaaatttgct tttgttcata tatcttttgc acaatttgaa 540 
ctgtcacaag gtaatgtcaa aaaaagtaaa caacttcttc aaaaagctgt agaacgtgga 600 
gcagtaccac tagaaatgct ggaaattgcc ctgcggaatt taaacctcca aaaaaagcag 660 
ctgctttcag aggaggaaaa gaagaattta tcagcatcta cggtattaac tgcccaagaa 720 
tcattttccg gttcacttgg gcatttacag aataggaaca acagttgtga ttccagagga 780 
cagactacta aagccaggtt tttatatgga gagaacatgc caccacaaga tgcagaaata 840 
ggttaccgga attcattgag acaaactaac aaaactaaac agtcatgccc atttggaaga 900 
gtcccagtta accttctaaa tagcccagat tgtgatgtga agacagatga ttcagttgta 960 
ccttgtttta tgaaaagaca aacctctaga tcagaatgcc gagatttggt tgtgcctgga 102 0 
tctaaaccaa gtggaaatga ttcctgtgaa ttaagaaatt taaagtctgt tcaaaatagt 1080 
catttcaagg aacctctggt gtcagatgaa aagagttctg aacttattat tactgattca 1140 
ataaccctga agaataaaac ggaatcaagt cttctagcta aattagaaga aactaaagag 1200 
tatcaagaac cagaggttcc agagagtaac cagaaacagt ggcaatctaa gagaaagtca 1260 
gagtgtatta accagaatcc tgctgcatct tcaaatcact ggcagattcc ggagttagcc 1320 
cgaaaagtta atacagagca gaaacatacc acttttgagc aacctgtctt ttcagtttca 1380 
aaacagtcac caccaatatc aacatctaaa tggtttgacc caaaatctat ttgtaagaca 1440 
ccaagcagca ataccttgga tgattacatg agctgtttta gaactccagt tgtaaagaat 1500 
gactttccac ctgcttgtca gttgtcaaca ccttatggcc aacctgcctg tttccagcag 1560 
caacagcatc aaatacttgc cactccactt caaaatttac aggttttagc atcttcttca 1620 
gcaaatgaat gcatttcggt taaaggaaga atttattcca tattaaagca gataggaagt 1680 
ggaggttcaa gcaaggtatt tcaggtgtta aatgaaaaga aacagatata tgctataaaa 1740 
tatgtgaact tagaagaagc agataaccaa actcttgata gttaccggaa cgaaatagct 1800 
tatttgaata aactacaaca acacagtgat aagatcatcc gactttatga ttatgaaatc 1860 
acggaccagt acatctacat ggtaatggag tgtggaaata ttgatcttaa tagttggctt 1920 
aaaaagaaaa aatccattga tccatgggaa cgcaagagtt actggaaaaa tatgttagag 1980 
gcagttcaca caatccatca acatggcatt gttcacagtg atcttaaacc agctaacttt 2040 
ctgatagttg atggaatgct aaagctaatt gattttggga ttgcaaacca aatgcaacca 2100 
gatacaacaa gtgttgttaa agattctcag gttggcacag ttaattatat gccaccagaa 2160 
gcaatcaaag atatgtcttc ctccagagag aatgggaaat ctaagtcaaa gataagcccc 2220 
aaaagtgatg tttggtcctt aggatgtatt ttgtactata tgacttacgg gaaaacacca 2280 
tttcagcaga taattaatca gatttctaaa ttacatgcca taattgatcc taatcatgaa 2340 
attgaatttc ccgatattcc agagaaagat cttcaagatg tgttaaagtg ttgtttaaaa 2400 
agggacccaa aacagaggat atccattcct gagctcctgg ctcatccata tgttcaaatt 2460 
caaactcatc cagttaacca aatggccaag ggaaccactg aagaaatgaa atatgttctg 2520 
ggccaacttg ttggtctgaa ttctcctaac tccattttga aagctgctaa aactttatat 2580 
gaacactata gtggtggtga aagtcataat tcttcatcct ccaagacttt tgaaaaaaaa 2640 
aggggaaaaa aatgatttgc agttattcgt aatgtcagat accacctata aaatatattg 2700 
gactgttata ctcttgaatc cctgtggaaa tctacatttg aagacaacat cactctgaag 2760 
tgt tat cage aaaaaaaatt cagtagatta tctttaaaag aaaactgtaa aaatagcaac 2820 
cacttatggc actgtatata ttgtagactt gttttctctg ttttatgetc ttgtgtaatc 2880 
tacttgacat cattttactc ttggaatagt gggtggatag caagtatatt ctaaaaaact 2940 
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ttgtaaataa agttttgtgg ctaaaatgac actaacattt 2980 

<210> 15 

<211> 2070 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 44517 5. 3. dec 



<400> 15 

cccccctccc tgttttccgt taggaacccg gcgaggaaat acatgcactg gctgagaatc 60 
gcccgcgcca gggcgcaacg ccacaaggtg tagggagtgt gcggggtggg gcgaaagggg 12 0 
acccaagagt ccctgtggct cggagtgccg ggccgtcggt tcttcattcc tgccctcggg 180 
gcagacggag tgaccccggc ccccactccc cgccccgacc atggtagtgt tcaatggcct 24 0 
tcttaagatc aaaatctgcg aggccgtgag cttgaagccc acagcctggt cgctgcgcca 3 00 
tgcggtggga ccccggccgc agactttcct tctcgacccc tacattgccc tcaatgtgga 3 60 
cgactcgcgc atcggccaaa cggccaccaa gcagaagacc aacagcccgg cctggcacga 42 0 
cgagttcgtc accgatgtgt gcaacggacg caagatcgag ctggctgtct ttcacgatgc 480 
ccccataggc tacgacgact tcgtggccaa ctgcaccatc cagtttgagg agctgctgca 540 
gaacgggagc cgccacttcg aggactggat tgatctggag ccagaaggaa gagtgtatgt 600 
gatcatcgat ctctcagggt cgtcgggtga agcccctaaa gacaatgaag agcgtgtgtt 660 
cagggaacgc atgcggccga ggaagcggca gggggccgtc aggcgcaggg tccatcaggt 720 
caacggccac aagttcatgg ccacctatct tcggcagccc acctactgct cccattgcag 780 
agacttcatc tggggtgtca taggaaagca gggataccag tgtcaagtct gcacctgcgt 840 
ggtccacaag cggtgccacg agctcataat cacaaagtgt gctgggttaa agaagcagga 900 
gacccccgac caggtgggct cccagcggtt cagcgtcaac atgccccaca agttcggtat 960 
ccacaactac aaggtcccta ccttctgcga tcactgtggg tccctgctct ggggactctt 1020 
gcggcagggt ttgcagtgta aagtctgcaa aatgaatgtt caccgtcgat gtgagaccaa 1080 
cgtggctccc aactgtggag tggatgccag aggaatcgcc aaagtactgg ccgacctggg 1140 
cgttacccca gacaaaatca ccaacagcgg ccagagaagg aaaaagctca ttgctggtgc 1200 
cgagtccccg cagcctgctt ctggaagctc accatctgag gaagatcgat ccaagtcagc 1260 
acccacctcc ccttgtgacc aggaaataaa agaacttgag aacaacattc ggaaagcctt 13 20 
gtcatttgac aaccgaggag aggagcaccg ggcagcatcg tctcctgatg gccagctgat 1380 
gagccccggt gagaatggcg aagtccggca aggccaggcc aagcgcctgg gcctggatga 1440 
gttcaacttc atcaaggtgt tgggcaaagg cagctttggc aaggtcatgt tggcagaact 1500 
caagggcaaa gatgaagtat atgctgtgaa ggtcttaaag aaggacgtca tccttcagga 1560 
tgatgacgtg gactgcacaa tgacagagaa gaggattttg gctctggcac ggaaacaccc 1620 
gtaccttacc caactctact gctgcttcca gaccaaggta tgttaggaag aagctggctg 1680 
gctgccatgt tggggcatct tgactatcag ataaaatacc aattttagac cctctacatt 1740 
gttctctcaa agactttgta aagtgggatg ggttttaccc ttgaaaagat caggatgtat 1800 
ttgaacagca tcttcttttt tagggcacag gattttccat tcaaggttgt gcttgtgaag 1860 
ggatgagaga gctgtagaat tctttgcagc cagagttgga caaagccaaa tggctaaact 192 0 
cactgtttgc tcattggaaa aaccaacaag tgtggttagt ccttgctctg ctcctaactt 1980 
tctatcacta atcaaatcac ttaacttcta agttattttc atgtcatatg aaaaagagta 2040 
aaggccacct taatctcttc taatctgtgt 2070 



<210> 16 
<211> 2923 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 980541.1. dec 



<400> 16 

tgccgtgaag attacaaatg agactgggaa 
gattgtgtcc tgagcccgca gtcaggctga 
aggagtctgt tccccgaacc tgtgtggacc 
acacatccct ggtcagctgt aggaaagcca 
attttgcaaa cacacaagcc ctctgcattt 
atttcctttg aaacacctgg ggtgaggaaa 
gaatctatga ctgaagagga tcggctaaga 
tttcacactc tgtcttaaaa tcagaagttg 
gtcatgggat gcagcagttc ttcaacgaaa 



accctcttca ataagacctg tgtgatgata 60 
aagagtcaac aaccagcaaa gtgaagatct 120 
tgatcaaacc tcgagggaaa ggctgggaga 18 0 
gagagcattt gagaagaggc tgaagcttga 240 
ccccagagag aaggtttttt tctcgtcttc 300 
aacctctaca ttggccaagc agaagaaaca 3 60 
gtggttcctc gcagcttaaa gggaggcact 420 
aattcatgaa cacatatgat ttagatagaa 480 
accaggagat ctgacacatc actgagagct 540 
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gcgttgatca tccagaactg gtaccgaggt tacaaagctc gactgaaggc cagacaacac 600 
tatgccctca ccatcttcca gtccatcgaa tatgctgatg aacaaggcca aatgcagtta 660 
tccaccttct tttccttcat gttggaaaac tacacacata tacataagga agagctagaa 720 
ttaagaaatc agtctcttga aagcgaacag gacatgaggg atagatggga ttatgtggac 780 
tcgatagatg tcccagactc ctataatggt cctcggctac aatttcctct cacttgtacg 840 
gatattgatt tacttcttga ggccttcaag gaacaacaga tacttcatgc ccattatgtc 9 00 
ttagaggtgc tatttgaaac caagaaagtc ctgaagcaaa tgccgaattt cactcacata 9 60 
caaacttctc cctccaaaga ggtaacaatc tgtggtgatt tgcatgggaa actggatgat 1020 
ctttttttga tcttctacaa gaatggtctc ccctcagaga ggaacccgta tgtttttaat 1080 
ggtgactttg tagatcgagg aaagaattcc atagagatcc taatgatcct gtgtgtgagt 1140 
tttcttgtct accccaatga cctgcacttg aacagaggga accacgaaga ttttatgatg 1200 
aatctgaggt atggcttcac gaaagaaatt ttgcataaat ataagctaca tggaaaaaga 1260 
atcttacaaa tcttggaaga attctatgcc tggctcccaa tcggtacaat cgttgacaat 1320 
gaaatcctgg tcatccatgg tgggatatca gagaccacag acttgaattt actccaccgt 13 80 
gtagagagga acaagatgaa atctgtgctg ataccaccaa cggaaacaaa cagagaccat 1440 
gacactgact cgaagcacaa taaagtaggt gtgactttta atgcacatgg aagaatcaaa 1500 
acaaatggat ctcctactga acacttaaca gagcatgaat gggaacagat tattgatatt 1560 
ctgtggagtg atcccagagg caaaaatggc tgttttccaa atacgtgccg aggagggggc 1620 
tgctattttg gaccagatgt tacttccaag attcttaata aataccagtt gaagatgctc 1680 
atcaggtctc atgaatgtaa gcccgaaggg tatgaaatct gtcatgatgg gaaggtggtg 1740 
actatatttt ctgcttctaa ttattatgaa gaaggcagca atcgaggagc ttacatcaaa 1800 
ctatgttcta gtacaactcc tcgatttttc cagtaccaag taactaaagc aacgtgcttt 1860 
cagcctcttc gccaaagagt ggatactatg gaaaacagcg ccatcaagat attaagagag 1920 
agagtgattt cacgaaaaag tgaccttact cgtgctttcc aacttcaaga ccacagaaaa 1980 
tcaggaaaac tttctgtgag ccagtgggct ttttgcatgg agaacatttt ggggctgaac 2040 
ttaccatgga gatccctcag ttcgaatctg gtaaacatag accaaaatgg aaacgttgaa 2100 
tacatgtcca gcttccagaa tatccgcatt gaaaaacctg tacaagaggc teat tc tact 2160 
ctagttgaaa ctctgtacag atacagatct gacctggaaa tcatatttaa tgecattgae 2220 
actgatcact caggectgat ctccgtggaa gaatttcgtg ccatgtggaa actttttagt 2280 
tctcactaca atgttcacat tgatgattcc caagtcaata agettgecaa cataatggac 2340 
ttgaacaaag atggaagcat tgactttaat gagtttttaa aggctttcta tgtagtgcat 2400 
agatatgaag acttgatgaa acctgatgtc accaaccttg gctaaacaca aatgagagct 2460 
tccctcaggc tccctgaaac agctaggccc aaatcacaag tacagtcctt tccaacaccc 2520 
ctgaaattca tagtcagtag cagagaaaag cagatcccaa ttcatcccca tcccacaaac 2580 
agatgeatag tatgggtttt ggaagtccct agcaagctgt tattggtaag attaggttaa 2640 
atgtcagtaa taggatttgg tttcagcatt agtacctaca tattgecagt gagaaactgg 27 00 
gttggaccta gtggtgttgt cgtgagtgcc acctaaccag gaggecagag cggtttgaaa 2760 
acatcctgaa aggaactcat acagcacaag agaaaactac taagcttgac atctgtgagt 2820 
gactgaggga gacaggagga ataccaggtt attcatggaa taaagtcttt ccatctttaa 2880 
actgtgatct tctttggaga ttttataagc cagtgatcct caa 2923 

<210> 17 

<211> 802 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 237996.1. dec 

<220> 

<221> unsure 
<222> 632 

<223> a, t, c, g, or other 
<400> 17 

cgtggctgag ccagcagctg cagcagctac gggagtggcc gggtggccgg cgggtgccag 60 
ccgccatgga ggccgtgccc cgcatgccca tgatctggct ggacctgaag gaggceggtg 120 
actttcactt ccagccagct gtgaagaagt ttgtcctgaa gaattatgga gagaacccag 180 
aagcctacaa tgaagaactg aagaagctgg agttgctcag acagaatget gtccgtgtcc 240 
cacgagactt tgagggctgt agtgtcctcc gcaagtacct cggccagctt cattacctgc 3 00 
agagtegggt ccccatgggc tegggecagg aggcegctgt ccctgtcacc tggacagaga 3 60 
tcttctcagg caagtctgtg geccatgagg acatcaagta cgagcaggcc tgtattctct 420 
acaaccttgg agcgctgcac tecatgetgg gggccatgga caagegggtg tctgaggagg 480 
gcatgaaggt ctcctgtacc catttccagt gcgcaccggc gccttcgcct acctaeggga 540 
gcacttccct caagcctaca gegtcgacat gagccgccag atccttacgc teaaegtcaa 600 
cctcatgctg ggccaggctc aggagtgect cntggagaag tcgatgttgg acaacaggaa 660 
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gagctttctg gtggcccgca tcagtgcaca ggtggtagat tactacaagg aggcatgccg 720 
ggccttggag aaccccgaca ctgcctcact gctgggccgg atccagaagg actggaagaa 780 
acttgtgcca gatgaagatc ta 802 

<210> 18 

<211> 667 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2432 67. 9. dec 
<400> 18 

aatgtgcatg ggatcactag catgtctgcg gagagcggcc ctggggacga gattgagaaa 60 
tctgccagta atgggggatg gactagaaac ttcccaaatg tctacaacac aggcccaggc 120 
ccaaccccag ccagccaacg cagccagcac caaccccccg cccccagaga cctccaaccc 180 
taacaagccc aagaggcaga ccaaccaact gcaatacctg ctcagagtgg tgctcaagac 24 0 
actatggaaa caccagtttg catggccttt ccagcagcct gtggatgccg tcaagctgaa 3 00 
cctccctgat tactataaga tcattaaaac gcctatggat atgggaacaa taaagaagcg 3 60 
cttggaaaac aactattact ggaatgctca ggaatgtatc caggacttca acactatgtt 420 
tacaaattgt tacatctaca acaagcctgg agatgacata gtcttaatgg cagaagctct 480 
ggaaaagctc ttcttgcaaa aaataaatga gctacccaca gaagaaaccg agatcatgat 540 
agtccaggca aaaggaagag gacgtgggag aaaagaaagt gggttatcaa gggtgatttg 600 
aaattttctg cagcattaaa gctggcgctt aataagaata agtaataata aagaaatttc 660 
taacaaa 667 



<210> 19 

<211> 1973 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 242082 . 10 . dec 



<400> 19 

gccatcgtga aaaagaaagc tgagctcatt aaagggaatt acaagtgcaa cgtgtgctct 60 
cgaaccttct tctccgaaaa tggcctccgg gaacatatgc agacccacct aggccctgtc 120 
aaacactaca tgtgccctat ttgcggagag cggtttccct cccttttaac tcttactgaa 180 
cacaaagtca cgcatagtaa gagtcttgat actggaaact gccggatttg caagatgcct 240 
ctccagagtg aagaggagtt tttagagcat tgccaaatgc accctgactt gaggaattcc 3 00 
ctgacaggct ttcgctgcgt ggtgtgcatg cagacagtga cctccacctt ggaactcaaa 360 
atccatggga cgttccacat gcaaaagaca gggaatgggt ctgcagttca gaccacaggg 420 
cggggccagc acgtccaaaa actgtataag tgcgcatctt gcctcaaaga attccgttcc 480 
aagcaagatc tggtgaaact tgatatcaat ggcctgccat atggtctgtg tgccggctgc 540 
gtgaatctca gtaagagcgc caccccaggc attaacgtcc ctcccggcac gaatagacca 600 
ggcttgggcc agaatgagaa tctgagtgcc attgagggga aaggcaaggt ggggggactg 660 
aagacacgct gctctagctg caacgttaag tttgagtctg aaagtgaact ccagaaccac 720 
atccaaacca tccaccgaga gctcgtgcca gacagcaaca gcacacagtt gaaaacgccc 780 
caagtatcac caatgcccag aatcagtccc tcccagtcgg atgagaagaa gacctatcaa 840 
tgcatcaaat gtcagatggt tttctacaat gaatgggata ttcaggttca tgttgcaaat 900 
cacatgattg atgaaggact gaaccatgaa tgcaaactct gcagccagac ctttgactct 960 
cc tgccaaac tccagtgcca cctgatagag cacagcttcg aagggatggg aggcaccttc 1020 
aagtgtccag tctgctttac agtatttgtt caagcaaaca agttgcagca gcatattttc 1080 
tctgcccatg gacaagaaga caagatctat gactgtacac aatgtccaca gaagtttttc 1140 
ttccaaacag agctgcagaa tcatacaatg acccaacaca gcagttagtg caagtacagt 1200 
ctctcaagga gaattgattt tgtggcacaa aaagggaaca tgttttactc tttgcacgaa 1260 
actttcattg ttaatgtata ttattcagaa acattgtatt gtaccataaa acttgtatta 1320 
tcaaactgtt ggatgttcat gtgtttgaac ttttgcgcac cggatagacc ccttgtatat 13 80 
aaagtgttgc acatgtatta tgtcgtctga tactaaaatg gtcttataaa gacaagtgga 1440 
cttgggccct attcaggcaa gattaaaaaa aaaaaaagac tatgaccaaa atggcttaag 1500 
ataaagtatt tttaaggaag aaagattaaa aacaactgtt atacatgaga ctatggttgg 1560 
acttcctttt ctttacactt aagcctagaa tttctcttta ggtatatcag cgcttaaatc 1620 
caagactatt ttttattgct gaagattctt gcaaaccatg aagagatgtt ctcacagaac 1680 
agaaccccac agctggataa ggcccgtata tatatatttg taagccttgc aatgtgacag 1740 
gtagcatcac tatatatgca atagttgtta tgtagactgt caaagaattt ttttttccct 1800 
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ggatacattt gaagctttga gtgttcaagg ttttccttaa tgatttcacg cagccaaatt 1860 
cttgaatcag ttgaactaac ctgtatgtta ctgttattaa tgtttactct gcagtctgaa 1920 
cctggagatt actggaattg ttttccaaga ggaaataaat tcagtttacc att 1973 

<210> 20 

<211> 2328 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 01923 9.1. dec 
<220> 

<221> unsure 
<222> 360 

<223> a, t, c, g, or other 



<400> 20 

tgaatgtgat gtttgcagaa aagccttcag 
agtacattct ggagaaaagc cttttaagtg 
tatacacctt gccagtcatt taaggattca 
gtgtggaaaa tccttcagca tcagttctca 
agagaagccc tatgaatgta aggtttgtag 
acagcatcag aaaacccata caggagagaa 
cttcagccag accacacacc tcattccaac 
ataaatgtat ggaatgtggg aaggcctttg 
gactgcacac tgggccaaag accttatgaa 
aaatcctccc ttatttgtca tcgcagaagt 
gtgtgtggca aagcctttag tcatcgtcaa 
ggaaagaaac catatgaatg taaggaatgt 
aatcaacata agagagttca tactggagag 
gtcttcaggc aaactgctca cttagctcat 
acatgcccct ctttaccttc cacgtcaaat 
aatccatcct ccctcccatc accatagcct 
cagtttaaaa ccccatctcc ctgccctttt 
ttcacataag tgtaaatgta acttattcac 
ttggttaatg tgtgagatct gctcagcaca 
acttagctgt tttaaaaact ttgtatttga 
taaaaaaatg atgcagtagg gtgagggtag 
ttctccagta gtgggtgagg ttttgccttt 
aagttagttt tgtggctttc catctgaccc 
ggtgagaaaa ttaaacttta catttgactt 
atgaataggc cagctgggac aaatgaattt 
catgaagtta aaacaactga tgttactcaa 
tttaaaaatc acatttataa gtgaaccgta 
ggtaaactga tccagaatag gggtcagcaa 
tgtttttgta tggtccatga tctagaattt 
aaagaatatt ttcaacatga aaattatatg 
ttttgttggc ttcagccaca cagtgattta 
cagcagagtt gagcagttgt gacaggagac 
gtctgatcct acacagaaaa tgtgtgtgat 
aagcccatgg accaaatcct atctgctgct 
cagccatgct tacttgttta gctattgact 
tatggctcgc aaaactgcaa atatttccta 
tctagagtag agaaaaatgt ataaaagatt 
atcagaaaca agaaaacaaa tgataaagga 
tattaaacta tattgaaaaa taaagatgtc 



ccatcatgca tcactcactc aacatcaaag 60 
taaagagtgc ggaaaagctt ttaggcagaa 120 
tactggggag aagccttttg aatgtgtgga 180 
gcttgccact catcagagaa tccatactgg 240 
taaagcgttc acccagaagg ctcacctttc 300 
accatatgag tgcaaggaat gcggtaaaan 3 60 
atcagagagt tcccactggt gagaaaccct 420 
gtgataactc atcctgtact caacatcaaa 480 
tgtattgagt gtggaaaggc attcaagaca 540 
catactggag aaaaacctta tgaatgcagt 600 
tcccttagtg tacatcagag aatccattct 660 
aggaaaacct tcatccaaat tggacacctt 720 
agatcttata actataagaa aagcagaaaa 780 
catcagcgaa ttcatactgg agagtcgtca 840 
cctgtggatc tgtttcccaa atttctctgg 900 
cgagacgtca tttctgtttg actactccag 960 
gttttctttt tgtcccttat tagttagttc 1020 
tcctcttgta aaacttatag tttctttaaa 1080 
gtgcctggtc ccatagtaag tgctcagtaa 1140 
acattgaaaa gttacagtag tcagctctga 1200 
gaaaaagcac attttctatc aggaacagaa 1260 
gttggtttta aaacttgatt ctataatgcc 1320 
tatgtgaatg taaggtgatg tgaccttgtt 1380 
gatttgtttt agaaagtcta gggaccatga 1440 
aaaaaatcag aaaaatgcaa gatttatatg 1500 
gaattagaaa actttgcaag atttgacttg 1560 
ttaaaacttt taaggaacca ttcattgtga 1620 
actatgactc atggccacag tctcactgac 1680 
aaaaaaattt taaagggttg aaaaaagtga 1740 
aaatttaagt tttggtgtct gtaaataaag 1800 
caccttgatt gtgctgcttc caggctgtag 1860 
catgtggcct gcagagccca aatatctact 1920 
ccctgctatg gagcagaggt tatcaaacta 1980 
tgtttttgta aatagagttt tatcaaacca 2040 
atggctgctt tagacaactg tgacagacta 2100 
tcctttaaca gaaagtttgc caacctctgc 2160 
ttaattttat gagggcaata caactgtcac 2220 
actcatttat caatagaggt gaaaggaaat 2280 
aataaaagga gaaatgat 2328 



<210> 21 

<211> 4209 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 899943.1. dec 
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<220> 

<221> unsure 

<222> 27, 3938 

<223> a, t, c, g, or other 



<400> 21 

gctgtttcgg tcgggagtgg gtggganaga agccggggca ggggaggagc cgccggagct 60 
gtcggagccg gcccttggaa gaaaatcctc gctgtgtcca ggctgaggcg gggggctaat 12 0 
gacagtgtga gctctagatg gtgtgagacc accccaaagc caagaaatgg ctacagccgt 180 
ggaaccagag gaccaggatc tttgggaaga agagggaatt ctgatggtga aactggaaga 240 
tgatttcacc tgtcggccag agtctgtctt acagagggat gacccggtgc tggaaacctc 3 00 
ccaccagaac ttccgacgct tccgctacca ggaggcagca agccctagag aagctctcat 3 60 
cagactccga gaactttgtc accagtggct gagaccagag aggcggacaa aggagcagat 420 
cctagagctg cttgtgctgg aacaatttct taccgtccta cctggagaac tacagagctg 480 
ggtgcggggc caacggccag aaagtggcga ggaggcagtg acgctggtgg agggtttgca 540 
gaaacaaccc aggagaccaa ggcggtgggt gactgtccat gttcacggcc aggaagtcct 600 
gtcagaggag acggtgcatt taggagtgga gcctgagtca cctaatgagc tgcaggatcc 660 
tgtgcaaagc tcgacccccg agcagtctcc tgaggaaacc acacagagcc cagatctggg 720 
ggcaccggca gagcagcgtc cacaccagga agaggagctc cagaccctgc aggagagcga 780 
ggtcccagtg cccgaggacc cagaccttcc tgcagagagg agctctggag actcagagat 84 0 
ggttgctctt cttactgctc tgtcacaggg actggtaacg ttcaaggatg tggccgtatg 9 00 
cttttcccag gaccagtgga gtgatctgga cccaacacag aaagagttct atggagaata 9 60 
tgtcttggaa gaaagactgt ggaattgttg tctctctgtc atttccaatc cccagacctg 1020 
atgagatctc ccaggttaga gaggaagagc cttgggtccc agatatccaa gagcctcagg 1080 
agactcaaga gccagaaatc ctgagtttta cctacacagg agataggagt aaagatgagg 1140 
aagagtgtct ggagcaggaa gatctgagtt tggaggatat acacaggcct gttttgggag 1200 
aaccagaaat tcaccagact ccagattggg aaatagtctt tgaggacaat ccaggtagac 1260 
ttaatgaaag aagatttggt actaatattt ctcaagtgaa tagttttgtg aaccttcggg 1320 
gaaactacac ccgtccaccc cctgttaggg aggcatcatg actgttctgt gtgtggaaag 1380 
agcttcactt gtaactccca ccttgttaga cacctgagga ctcacacagg agagaaaccc 1440 
tataaatgta tggaatgtgg aaaaagttac acacgaagct cacatcttgc cagggcacca 1500 
aaaggttcac aaggatgaac gcgccttaca aatatcccct aaaccggaag aatttggaag 1560 
agacctcccc tgtgacacag gctgagagaa ctccatcagt ggagaaaccc tatagatgtg 1620 
atgattgcgg aaagcacttc cgctggactt cagaccttgt cagacatcag aggacacata 1680 
ctggagaaaa acccttcttt tgtactattt gtggcaaaag cttcagccag aaatctgtgt 1740 
taacaacaca ccaaagaatc cacctgggag gcaaacccta cttgtgtgga gagtgtggtg 1800 
aggacttcag tgaacacagg cggtacctgg cgcaccggaa gacgcacgct gctgaggaac 1860 
tctacctctg cagcgagtgc gggcgctgct tcacccacag cgcagcgttc gccaagcact 1920 
tgagaggaca cgcctcagtg aggccctgcc gatgcaacga atgtgggaag agcttcagtc 1980 
gcagggacca cctcgtcagg catcagagaa cacacactgg ggagaaacca ttcacgtgcc 2040 
ctacctgtgg aaaaagcttc agcagaggat atcacttaat taggcatcag aggacccact 2100 
cagaaaagac ctcctagcta ggtccccatg tgaggagatc tgctttcagc cc tcacctaa 2160 
gggaggtgag gaagaggaaa agccctcttg tcagcctggg aagacctttt cgagggagtc 2220 
tccctgacct gctcagatct gacattacct cttcctgcaa ctaaacacga gcctgggcag 2280 
aacctctcag ccttcctcta cgccttgagg ggatgtttca tccaaagtac aacctgaatt 2340 
gaggcttctc cttcactgga gtgcacctgc ctctacctca tgggtataaa gtaggagaac 2400 
taagagactt aagaggtcgt ggttcctata tcgtccaaaa aataggctgt tacatatcct 2460 
aaagactgct caacagcttc aagttgaaag tggccaagga cagcccctta ggtttgggaa 2520 
gggacgagcc tgaaggattc tgtctttact ggggtcaaat cttaaagcac acagctctgg 2580 
actcaagaca ggaggtttgc gtcctgatgg ctttgccaca cattcacagg ataactgcat 2640 
agatccctcg ctgtctgatt cacttcttac catgcacttt cctttgatgc tgaggagaaa 2700 
tggaagtggg cgaaaaatct caaggctgct tcatgtggac cttgtcaagc tgctccctcc 2760 
cccagcgtca aattgttatc aggtgccaaa cactgctaga aaggagggcc tagtcagaag 2820 
cctctttcca tacgagtttt ggttttgttt ttaatatttt tttctattaa aatactcatg 2880 
catttaacct tcccgttatt caaccagtct cttggttgca tccctagcac ttctactaca 2940 
agtgagatgg tagtgtttga gtgcttattg agtaaagcat aattcggtca taatgaaatc 3 000 
gttcacattc cctcatatgc acaagcccac caaccccttc acacccccct tcacaggggt 3 060 
cgtatgagta aggggatttg gaaactgtca acttacaaag gcactataac aattacagaa 3120 
tcatgattgc catgggccac tttatttaca tgaagacaac tggagaacga ctaagaccaa 3180 
attatggaaa ataagaaaaa gctgttgctg gcaagaccat caagactgtt ctgacaccct 3240 
gtccccatca tccctgactg agtactctga catcacggaa agtgttgaac ctgggaccct 33 00 
gaggaattca ccaggagtaa atggctttca tgtatttgtg ttgtttgctt tttcttacgt 33 60 
gattttatgt tcatagagct agaaagtagc atctcatgat ggcccaacaa tctctgttgc 3420 
cagttaaagg ttccttggag atgaggctga ataattatga acctcacctt ctctgattgt 3480 
gggagtggca agaactgggg agacgtcctc cataagtgga gcacagggta tggggttaaa 3 540 
gcatgacagg gagagtcttc tgtgcctggt ttcttctcct ctatctcata atgcattatg 3 600 
ggcccgagga ataggggagg gttaataaga ctccaaccct aatggcccaa cagggaaatt 3 660 
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ctcattttgg tcgatgatat tctgatggac tggtttggtc ttaataccag tcaaccgttg 3720 
tccttctgga aatatacata tatgaaataa ataaaggtaa cacttgcagc caagttccct 3780 
ggtttctggg acttcccatc ttacccattc cttttccagg gcttccagtg tcctgatact 3840 
tctgagggtg gttcatactc caaattagat tctggggagt tacagagtaa ttttttcctt 3 9 00 
gagggaaagg gaagggttgg gggatggatt taggcagnag ttccggtgga aacattattt 3 9 60 
gcactctgag ataagatcca agcctggagt ttgcagaaga tactgtccta ataagcaggc 4020 
atttctaaac caagtatcta agcctaagca cagcttgtcc tgggtgaaat gtctgccaca 4080 
aaagatagtt tctcctagct cagacttaac catttataaa ggttggtaaa atactggcag 4140 
tgacaacaaa ttgacttttt aattttctta tttgcattat tccaataaat gaaaatctgt 4200 
cagagttcg 4209 

<210> 22 

<211> 710 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 443 551.1. dec 

<400> 22 

gtttctttca tatatcagaa cttcatattc attggagaga ccatacagga gagaaggtct 60 
ataaatgtga tgattgtggt aaggatttta gcactacaac aaaacttaat agacataaga 120 
aaatccacac agtggagaag ccctataaat gttacgagtg tggcaaagct tcaattggag 180 
ctcccatctt caaattcata tgagagttca tacaggtgag aaaccgtatg tctgtagtga 240 
gtgtggaagg ggctttagta atagttcaaa cctttgcatg catcagagag tccacaccgg 3 00 
agagaagccc tttaaatgtg aagagtgtgg gaaggccttc aggcacacct ccagcctctg 3 60 
catgcatcaa agagtccaca caggagagaa accctataaa tgttatgagt gtgggaaggc 420 
gttcagtcag agttcgagcc tctgcatcca ccagagagtc cacactggag agaaacccta 480 
tagatgttgt ggatgtggga aggccttcag tcagagttcg agcctgtgca tccaccagag 540 
agtccacaca ggagagaaac ctttcaaatg tgatgagtgc ggaaaggcct tcagtcagag 600 
tacgagcctc tgcatccacc agagagtcca cacaaaggag agaaaccatc tcaaaatatc 660 
agttatataa aacgttttgc taagagttta aaatcttaaa acccataagt 710 

<210> 23 
<211> 1047 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 897957.1. dec 
<220> 

<221> unsure 
<222> 64, 71 

<223> a, t, c, g, or other 
<400> 23 

aagaatgtga tgaagctttc agattcaaat caacccttga tagtcatgga attcatacta 60 
gagnggaacc nttagcagtg taatgaacgt ggcaaaggtt ttaaatcaaa aagcaaagct 120 
tgcacatcat catagaattc atactggaga taaacgttac aaatgtgaag catgtgacaa 180 
agtttacagt cgcaaatcaa gcctcgaaag acaggagaat tcatactgga gagaaagctt 240 
acatatgtga agaatgtcac caagttttca gtcacactca aaccttgaaa gacacagcag 3 00 
aattcctact ggagagatag cataaaaatg taagagtttg tgacaaggct ttcaggcata 3 60 
attcgcacct ggcacaacat cctagaattc acattggaga gaaagcttac aagtataatg 420 
aatgtgacag gtctttagtg ggcagtcaac acttgtttac catcaggcaa tccatggtgt 480 
agggaaactt tacttatgta atgattgtca caaagtcttc agttacacta caaccattgc 540 
gaatcattgg agaatccata atgaagagag atcatactag tttaataaat ttggcaaatt 600 
tttcagacat tgttcataac ttgcagttca tcggcgaact cgtactggag agaaacctta 660 
caaatatcat gactgtggca aggtcttcag tcaagcttca tcctatgcaa aacataggag 720 
gaattcatac aggagagaaa cctcacaagt gtgatgattg tggcaaagtc ttgacttcac 780 
gttcacacct cattagacat cagagaatcc atactggaca taaatcttac aaatgtctta 840 
agtgcaacaa ggtcttcagt ctgtgggcac tccatgcaga acatcagaaa attcattttt 900 
gagataactg ttccaaatac agtgactata gaagatcata aagctttaat tgacattaga 960 
gccaaatagg cattgacttg agattgagtt gacttaacct tgagtttaag aattaattta 1020 
cattaaagtg tttatgttaa gaagaaa 1047 
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<210> 24 

<211> 676 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 900911.1. dec 
<400> 24 

ttccgttttc gcgtggttct tttgcaagct ctggattctc tggagtttga atgcatccag 60 

tattggaacc ccaccaagta ggactgatca ggtcttacaa ttctaaaacc atgacctgtt 120 

ttcaggaatt agtgacattc agggatgtgg ccatagactt ctctcggcag gagtgggaat 180 

acctggaccc taatcagagg gacttataca gggatgtgat gttggagaac tatagaaacc 240 

tggtatcact gggaggacat tccatttcta aaccagttgt ggttgattta ctggagcgag 3 00 

gaaaagagcc ctggatgatt ttgagggaag aaacacagtt cacagatttg gatttacagt 3 60 

gtgagataat cagctacata gaagtaccca cttatgaaac agatatatcc tctacacaac 420 

ttcagagcat atataagaga gagaaactct atgaatgtaa gaaatgtcag aagaaattta 480 

gtagtggtta tcaacttatt ctacatcaca ggtttcatgt cattgagaga ccctatgaat 540 

gcaaagagtg tgggaagaac tttcgtagtg gctatcaact tactctacat caaagatttc 600 

atactggtga gaaaccctat gaatgtacag aatgtgggaa gaactttaga agtggttatc 660 

agctgactgt gcatca 676 

<210> 25 

<211> 631 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 999296.1. dec 
<400> 25 

atgagtttgg gaaaccattt taccattgtg catcctatgt tgtaaccccc tttaagtgta 60 

atcagtgtgg acaagacttc agtcataaat ttgacctcat tagacatgag cgaattcatg 120 

ctggagagaa accttacgaa tgtaaagaat gtggaaaagc cttcagtagg aaggaaaatc 180 

ttattacaca tcagaaaatt catactgggg aaaaaccgta taagtgtaat gaatgtggaa 240 

aagctttcat tcagatgtca aaccttatta gacaccacag aat tea tact ggggagaaac 3 00 

ettatgeatg taaggattgt tggaaagect tcagtcagaa atcaaatctc attgaacatg 3 60 

agegaattea cactggagag aaaccctatg aatgtaatga atgtgggaaa tccttcagcc 420 

agaagcaaaa tcttattgag catcagaaaa ttcatactgg ggagaaacct tatgcatgta 480 

atgaatgtgg aaaagectte agtcaaagca tgcatcttat tgtacatcag agaagecata 540 

ctggaaaaaa ccctatgagt gtagtcaatg tggaaaagee ttagtaagag ctcaactctt 600 

accctacatc agegaaatea cactggagaa a 631 

<210> 26 

<211> 577 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 442286.1. dec 
<400> 26 

gcgatgtggc gettgegate tctcgccgcc ggcagaggct cctcgaagag egacaegggg 60 

ctgaccaggc acggtggtca aagcegcaga gggagagegg gaeeggtegt gagategtet 120 

ggggagaagg gcggaggcaa agecgaggag gtgcgggttg tggtccattc tggaggaege 180 

tgatcgaatg ccccaaactt cccggaatgt gtgtggaccc ttctagcctg aggagtcctg 240 

caggtgtgaa gctccacacc tgcctccata gcactttgcc tgtccctaag agggctcatc 300 

ggagaagaaa gaatggctgt cagccacctg ccaaccatgg tccaggaatc ggtgaccttc 360 

aaggatgtgg ctatactgtt cacccaggaa gagtgggggc agctgagccc cgcccagagg 420 

gccctgtaca gggacgtgat gctggagaac tacagcaacc tggtctcact gggactctta 480 

ggacccaaac cagataegtt ttcccagcta gaaaaaaggg aagtgtggat gecagaggae 540 

acccctggag gcttctgtct tgatggagtc tcactct 577 

<210> 27 
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<211> 1349 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 901978.1 -dec 
<220> 

<221> unsure 

<222> 1267, 1324, 1329, 1341 
<223> a, t, c, g, or other 



<400> 27 

gcagcccgag 

gcgggataaa 

tggcggctgt 

aaaaaggaca 

gcttggtggc 

atccttggca 

cagtagggtg 

tggtgacaga 

aagccaaatg 

aaac ggaaag 

tcagcacgaa 

actgtagtca 

tccataatgg 

catcccttag 

gttttcacgt 

agaat caeca 

ggcacaatag 

tcaggaagag 

aatgtaacga 

tacatacegg 

cagctgtgaa 

gtggganatc 

tggnttgant 



agggaggagg 
ggaggaacgt 
catcctgccc 
cacacagggc 
ctttgaggat 
gaaaaaactc 
tcgtgttaat 
ggaaagagga 
gttaactcct 
aagtcatcgt 
atctaaccta 
atgtgggaag 
ggagaaaccc 
actgeatttg 
tttccgcacc 
tgaatgtaat 
cattcataca 
ctcctatctg 
gtgtgggaaa 
agagaaaccc 
gaaacactta 
cttcacaagt 
tactgeaggg 



ccacggagac 
cctgcttccc 
tegactgetg 
ggagagctgg 
gtggcagtga 
tacagagatg 
aaacccagtc 
attctaccaa 
aagaagaatg 
ggagtgaaac 
actcagcaca 
tccttcagta 
tatgaatgea 
agaattcaca 
agttgtaacc 
cagtgtggaa 
ggggagaaac 
acacagcacg 
tccttcagca 
tacgagtgea 
agaactcaca 
aactcctatc 
ncttctgga 



ttggagcatt 
ggctgccctg 
ctccgtcttc 
ttaatgagct 
acttcaccca 
tgatgctgga 
tgatatccca 
gcacctgtcc 
ttttcagaaa 
tcaatgaatg 
agagaattca 
gcagatctta 
atcactgtgg 
ctggagaaaa 
teaaaageca 
aagctttcag 
cttatgaatg 
taagaactca 
gtagcttttc 
gtgactgtgg 
ctggagaaaa 
ttttctgtgc 



cccgtttctt 
ttgctgtcgg 
cctgttccca 
cctgacaagc 
ggaggagtgg 
aacctatagg 
gttggaacaa 
agatttggag 
agaacagtct 
taatcagtgt 
tactggagaa 
ccttactatt 
gaaagcattt 
accctatgaa 
caagaggatt 
cacaaggtcc 
tcacgattgt 
tactggagaa 
tcttactgtg 
aaaagccttt 
accctatgaa 
acaagaggat 



ccagagcgct 60 
agtcacagga 120 
gcctctcagc 180 
tggctacggg 240 
getttgetgg 3 00 
aacctggctt 3 60 
gacaagaagg 420 
actctactta 480 
aaaggtgtaa 540 
tttaaagtct 600 
aaaccctatg 660 
cataagagaa 720 
agtgatccct 780 
tgtaaccagt 840 
cacaeggggg 900 
tctctcactg 960 
gggaaaacct 1020 
aaaccctatg 1080 
c ac aagagaa 1140 
aataatctct 1200 
tgtaatcatt 1260 
acataataga 1320 
1349 



<210> 28 

<211> 1696 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis c_f eature 

<223> Incyte ID No: 479346.1. dec 

<220> 

<221> unsure 

<222> 1089, 1237, 1283 

<223> a, t, c, g, or other 



<400> 28 

cgccataggc cagtgccggg gtttaagggc caggaaagga agcattcagg gaatttaggt 60 
gtagccagaa gaaaatcagg tcctggctcc ccagaagcaa gagagttcaa gtgaaggaag 120 
gaggaggttc ctggatgtgg atg tea teat ttctgggaac actcttaaat ggagactcag 180 
atttcttagc caaaatttag ggaggatcca gaagaaacca aagacgaagc atcccagttc 240 
ttgggtattt cctgaaacag aagaaaatga caaaggccca ggtaactttt ggtttgtttt 300 
attcttaact cgattccctg atggatttgt ggaagtctga agaaatcagt tcatttacat 3 60 
taaaagtata aattggaaaa aatgagctct ggggagaaac agacttagac tgtaccttta 420 
atccaggaca aagtgggatc acaaatactt tgttcacata gttcacatgt gaccagtgtt 480 
ggaatataat gtgegactet gccagaccct gttggctaca ttaagtgtca gcttgtgtcc 540 
aaaatctttg gccaccatca cacccagaga atcatggaat cactgaccct ggaggatgtg 600 
gctgtggact tcacctggga ggagtggcag ttcctgagcc ctgctcagaa ggacctgtac 660 
cgggatgtga tgttggagaa ctacagcaac cttgtgtcag tggggtatca ageeggcaaa 720 
cctgatgccc tcaccaagtt ggaacaagga gaaccactat ggacactaga agatgaaatc 780 
cacagtccag cccacccaga aattgagaaa gctgatgatc atetgeagea gcccttgcaa 840 
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aaccaaaaaa tactgaagag gacgggacaa 
tatttaggtt taaccaacca gagcagaaga 
ggagatggag cttttctcca tgataatcat 
gaaagtagaa aacccatcag caccaagtca 
atagagaang cccatgaatg cactgactgt 
actgagcata agagaattca tacaggaaag 
gccttctaca agaagtacag gctcactgaa 
ccacgggtgt agcttgtgtg ggnaaagcct 
agagagctca caaaggagag gaaaccatac 
aggaaatctg agcttactga acatcaaagg 
agcgaatgtg ggagagcttt ctccagaaaa 
acaggagaga agcctcatac atgcagtgaa 
ctcaacatac atcaacgaac tcacactgga 
aaggccttca gccagaagtc ttgccttgta 
ccctttgtat gtcctg 

<210> 29 

<211> 2459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 481750.1. dec 
<400> 29 

cattaagaat cctagcatta ttggccacag 
gtatatagtt aactatattg ggggcattgc 
tatctgggcc tggaaaaaat gtaccactta 
tgactcctca ggagcaaaaa acataacctg 
atttcttggg gcacacttga ttgaaaactg 
aagacagacc atcccagttg aatgctgtct 
aggaatccat aacactggag gatgtggctg 
tgggcgctgc tcagaaggac ctgtaccggg 
tggcagtggg gtatcaagcc agcaaaccgg 
aactgtggac aattgaagat ggaatccaca 
atcatgtgct ggagcgcttg cagagtgaaa 
aacatgatgc atttgaaaat attgttcatt 
atcatgatat atttgactta cgtggaaaaa 
agagcaaagg ctatgaaata aagaactctg 
ttcatgctaa ccatgaacga cttcatactg 
tcagcactaa gtcccaattc atcagtccca 
atcatgtgtg cagtgaatgt gggaaagcct 
aggtaatgca tacaggagag aaaccccaca 
gaaagttcat gcttactgaa catcagcgaa 
ctgaatgtgg caaagccttt ctcaagaaat 
ccggagagaa accctatata tgcagtgaat 
tcattgtaca ccagcgaatt catacaggtg 
aaggcttcat tcagaagacg tgtctcatag 
cctttgtgtg cagtgaatgt ggaaaatcct 
aaagaattca cacaggagag aaaccctttg 
caaagcaaaa gctcattgtc catcaaagga 
acgagtgtgg gaaagcgttt gtgtatatgt 
caagggagaa acaagaggca gccaaggtgg 
tacacaccag tgatgtcatg caggagaaaa 
cttctgtggc ccctcagaca tcattaaaca 
tccttgtggg acagccagtg gtcagatgtg 
aggacagaaa ccttgtgaat gcagtgaatg 
tattttatgt tacagaaaac ccataggaag 
gcaaaaattt gtagttcatt atgtggccga 
ctgagatagc ctgataaact catcctatta 
gatgttaacc taagcacata cacagcaatt 
aagcccaagt acttttaagt ggaggaaatg 
taattggtgt cgggagtgtt gcagtgactg 
agggctaata ctgtaatggt ggaatgtggg 
ccagtgccag tgtggaggat taattgaaaa 
tatgcagagg gatctgtgaa actagtcaat 



cgctatgaac acggaagaac tttgaaatca 9 00 
tacaacagaa aggagcctgc tgagtttaat 9 60 
gaacaaatgc ctacggaaat tgaattccct 1020 
caattcctta aacatcagca aacacacaac 1080 
gggaaagctt tcctcaagaa gtctcagctc 1140 
aaaccccacg tgtgtagctt gtgtgggaaa 1200 
cacgagngag ctcacagagg aggagaaacc 12 60 
tctacaagag gtacaggctc actgaacacg 1320 
gggtgcagtg aatgtgggaa agccttcccc 13 80 
attcacacgg gaattaagcc ccatcaatgc 1440 
tcactactcg ttgtacatca gcgaactcat 1500 
tgtggaaaag get teat tea gaagggcaat 15 60 
gagaaacctt atggatgcat tgactgtggc 1620 
gcacatcaga gatatcatac aggaaagact 1680 

1696 



gtataaaatc cgctggggct ccaaacaaag 60 
tttaagatga gtgectagea gcagtgacag 120 
cattgaattt gctcttcaac ccacaggttc 180 
aagagggagg aagtggattt ggggttcacc 240 
agacttctga agagaaggee agaagataca 3 00 
tccaagaaca gaagaaaatg atccaggccc 3 60 
tggacttcac ttgggaggag tggcaactcc 420 
atgtgatgtt ggagaactac agcaacctgg 480 
atgeactett caagttggaa caaggagaac 540 
gtggagcctg ttcagacata tggaaagttg 600 
gcctggtgaa cagaaggaaa ccatgtcatg 660 
gcagcaaaag tcagtttctg ttagggcaaa 720 
gtttgaaatc caatttaact ttagttaacc 780 
ttgagtttac tggaaatggg gactcctttc 840 
caattaaatt ccctgcaagt caaaaactca 9 00 
agcatcagaa aacacgaaaa ttagagaagc 9 60 
tcatcaagaa gtcttggcta actgatcacc 1020 
gatgtagtct atgtgagaaa gccttctcca 1080 
ctcatacagg agaaaaacct tatgaatgee 1140 
cacggctcaa catacatcag aaaacacata 1200 
gtggaaaagg cttcatccag aaaggaaatc 1260 
agaaacctta tatatgeaat gaatgtggaa 1320 
cacatcagag atttcacaca ggaaagaege 1380 
gttctcagaa atcaggtctc attaaacatc 1440 
aatgtagtga atgtgggaaa gectttagea 1500 
ctcatacagg agagagaccc tatggctgta 1560 
cgtgtctggt taagcataag agaatacaca 1620 
aaaatcctcc tgcagagagg cacagctcat 1680 
actctgetaa eggggegact acacaagtgc 1740 
tcagcggcct cctcgcaaac aggaaegtag 1800 
cagcctcagg agataacaga ggatttgeae 1860 
tggttgtgcc ttccgtgatc aattatgtct 1920 
aaaactcaga tctatgtgga aaagggttga 1980 
aaagcataca ctgagagaac atgtataagg 2040 
aaatgtatgc tgtgatacac aggcaaattt 2100 
gctcgactgt gtcaattaaa tgagtaaagg 2160 
aagtactgtg aatttttaaa acagatgaga 2220 
aaaagcttca taaggggcag ttggtgggaa 2280 
acacatatgt gtcaattcag tttccttaga 2340 
ccaggaccat aaactttcag aactttgtat 2400 
ttgtttcatt aaacaaatat tttaaaatt 2459 
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<210> 30 

<211> 469 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 900917. 2. dec 
<400> 30 

ccatctttac cggctgtggc tcagcatggt cagtgccgtt taacctttat tgtggtggag 60 
tccaccccgc tgtgggcctc agataaccct gtacaaaggg gaatggagat tgcctgtatc 120 
cacctagatt cataagctgc cctgaggcga tcttggcatc aagaaagaca gcattgaggc 180 
acatctcacc atcagcttca gaggatgtca gcctttgata tgtcccatgg gttttttccc 240 
agggaaccaa tctgtccttt tgaagaaaag acaaagatag gaacgatggt agaggaccac 3 00 
cggtcaaatt cttaccagga ttcagtgacg tttgatgatg tggctgtgga gttcacccca 3 60 
gaggagtggg ctttactgga cacaactcag aaatacctct acagagatgt gatgctggag 420 
aactacatga acctggcctc tgtggatttc tttttctgct taacttcag 469 

<210> 31 
<211> 682 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 999415.1. dec 

<220> 

<221> unsure 
<222> 382 

<223> a, t, c, g, or other 
<400> 31 

ccaccatctt ttgggtccgg gaggattgag tcatcgggct ccaatgcgtg gggatgttta 60 
ccgccgttta tccgggatag agactccatc gtgctgacag catcctttta ttcaccgcct 120 
ccgaatttgc aaagaggagg aaggagggac ttcttggctt ctcccagcat agccccagtt 180 
atgccatctc agaactatga ccttccccag aagaagcagg agaaaatgac caagtttcag 240 
gaggctgtga cattcaagga tgtggctgtg gtcttctcca gggaggaact gcgactgctc 3 00 
gatcttaccc agaggaagct gtaccgagat gtcatggtgg agaacttcaa gaacctggtt 3 60 
gcagtgggca gcaagcatca anaattaagg atggaaacac tccaaaaatt tgcattaaaa 420 
tacctttcaa atcaagagct gtcctgctgg caaatctgga aacaggttgc aagtgaatta 480 
accaggtgtc ttcaggggaa gagttcccag ttattacaag gtgactctat tcaggtttct 540 
gaaaatgaga acaatataat gaaccctaaa ggagatagct ctatttatat tgaaaatcaa 600 
gagtttccat tttggagaac ccagcattct tgcgggaata catatctgag tgagtcacag 660 
attcagagta gaggtaagca aa 682 

<210> 32 

<211> 996 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis c_f eature 

<223> Incyte ID No: 900680. 2. dec 
<400> 32 

ggttggcttc cgggatctgg cgcggcgttt tcctctggct cctgcgaggg cttggtttag 60 
ggcttcagct ctctgcgttc tcggctccgg gaggcctcgg tgattcagcc acagcctctg 120 
cctcccgttg ctctgtgacc tgagggtatt ggacaatttg tagctaagac tcccggatac 180 
cctgaagtcg ggaaatggaa ctcgtaacat tcagggatgt ggccatagaa ttctcccctg 240 
aagagtggaa atgtctggac cctgcccagc agaatttgta tagagatgtg atgttggaga 3 00 
actacaggaa cctggtctcc ctgggttttg tgatctctaa cccagacctg gtcacctgtc 3 60 
tggagcaaat aaaagagc cc tgcaatttga agatacatga gacagcagcc aaacccccag 420 
ctatatgttc tcctttcagc caagaccttt caccagtgca ggggatagaa gattcattcc 480 
acaaacttat actgaaaaga tacgagaaat gtggacatga gaatttacaa ttaagaaaag 540 
gctgtaaacg tgtgaatgag tgtaaggtgc agaaaggagt taataatgga gtttaccagt 600 
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gcttgtcaac tacccagagc aaaatatttc aatgtaatac atgtgttaaa gtttttagta 660 
aattttcaaa ttcaaacaaa cataagataa gacatactgg agagaaaccc tttaaatgta 720 
cagaatgtgg cagatcgttt tacatgtcac acctaactca acatacagga attcatgctg 78 0 
gagagaaacc ctacaaatgt gaaaaatgtg gcaaagcctt taataggtcc acatcactta 840 
gtaaacataa gagaattcat actggagaga aaccctacac atgtgaagaa tgtggcaaag 900 
cctttagacg gtccacagtt ctgaacgaac ataagaaaat tcatactgga gagaaaccct 9 60 
acaaatgtga agaatgtggc aaagccttta caaggt 9 96 

<210> 33 

<211> 2098 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 902791 . 3 . dec 

<400> 33 

ggctgacgcg gcggctctat ctcccgtaac tgtgacacgg gtgcacgcaa gcgtcattgg 60 
gggtgatggg ggccgtgctc ggtgcgcttc tgcaccggtg acgcaaccgc tgtgtctccg 120 
ccagtccgcg caggccagca tccttcagaa aaagcatccc cgaggaggaa gacgaatcgt 180 
taaacatctt aggtcagctc tagcctctcg gaatttgtct tcttcagtgg aaaccccgag 240 
aagactgatc agttcttcag ttctaaaaca atggcccagg gtttggtgac gttcgccgac 300 
gtagccatag acttttctca ggaggagtgg gcctgtctga actctgctca gagggacctg 3 60 
tactgggacg tgatgctgga gaactacagt aacttggtct cactggattt ggagtcagca 420 
tatgaaaata agagtttacc tacagaaaaa aacattcatg aaataagggc ttccaaaagg 480 
aattcagata gaagaagtaa atcccttggc cgtaactgga tatgtgaagg tacgcttgaa 540 
agaccacagc gctccagagg gaggtatgtc aatcagatga tcatcaatta tgtcaaaaga 600 
cctgctacta gagaaggcac cccttcctag aacacatcag agacatcata aggagaattc 660 
ctttgaatgt aaggactgtg ggaaggcctt tagtcgtggc tatcaactta gtcaacatca 720 
gaaaatccat actggtgaga aaccttatga atgtaaagaa tgtaagaagg ccttccgttg 780 
gggcaatcag cttactcaac atcaaaaaat tcatactggg gagaagccct acgaatgtaa 840 
agactgtggg aaggcttttc gatggggctc aagcctcgtt attcataaga ggattcatac 900 
tgggtgaaaa accctatgaa tgtaaagact gtggaaaggc ctttcggcgt ggtgatgagc 960 
tcactcagca ccagagattc cacactgggg agaaagacta cgaatgcaaa gactgtggga 1020 
agacctttag ccgtgtgtat aaacttattc agcacaagag aattcatagt ggggagaagc 1080 
cttacgagtg taaagactgt gggaaggctt ttatttgtgg ttcaagcctc attcagcata 1140 
aaagaattca cacaggtgag aaaccctatg aatgtcaaga atgtgggaag gcctttactc 1200 
gagtcaatta ccttactcag catcagaaga tccacaccgg tgagaagcct cacgaatgta 12 60 
aggagtgtgg gaaggccttt cgctggggtt cgagcctcgt taagcacgag aggatacata 1320 
cgggcgagaa gccgtacaag tgcacagaat gtgggaaggc cttcaattgt ggctatcacc 1380 
tcactcagca cgagagaatc cacacaggcg aaaccccgta taaatgtaag gagtgtggga 1440 
aggctttcat ttatggatcg agcctcgtga aacatgagag aattcatacc ggggtgaaac 1500 
cctatgggtg tacagaatgt gggaagagct ttagtcacgg ccatcagctt acacaacatc 1560 
agaaaacgca cagtggggcg aaatcctacg aatgtaagga gtgcgggaag gcatgtaacc 1620 
acctaaacca tctccgagaa catcagagga tccacaacag ttgaagagcc ttttgaacgc 1680 
agtagcccgc tcgtatctat ggtttcgctt tccacagttt gttacctgca gtcaactgca 1740 
gttcaaaaat attaaatgga aaattccaga aataaagaat tttaagtctc aaatggtgtg 1800 
cccttctgag tagcgtgatg aaatctctcg ctgtccggct ccagccggcc ggggatgtga 1860 
gtcatccctt ggtccagcac atccacgctg tatacgccac ccaccctgct agtgacttag 1920 
tagccgtctt ggtgatcaga tcaactatcc cagcatcaca gtgcctgtgc ccaagcagtc 1980 
ctcactttgc ttaacagtgg ccccagagag caggagtagt gatgctggtg attcggatat 2040 
gccaaagaga agccacaaag tgcttccttt taaatgaaaa ggtgaaagtt ctcaactt 2098 

<210> 34 

<211> 1520 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 053826.1. dec 

<220> 

<221> unsure 

<222> 1479 

<223> a, t, c, g, or other 
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<400> 34 

gaattccccc cccccccccc 
cattctctgt tgttacctgt 
tgaaaatttc ccactgaaaa 
atggttcaca ttcattttag 
gttcctctta ggggtctgcc 
atctcctagg cttgagttca 
gctcccaaat gatgtctttg 
taactctacc tttctgcttt 
caaaatgata gcaatctctg 
aagtaccgtc gtcctactca 
tattgaagca gctctgaaag 
gcgctacatt tcgcagaatg 
caaagtgttc ccaccggcag 
atcggcagag gcttgggcgg 
atttttgggc caaaatctat 
agttggtcaa gccatggtat 
gcaaccccag atgtcctatg 
tttgggccac ttccaatcgg 
ggggatctgt gtggcgacgt 
ggattggaga agcaccatat 
ggggatcttg tactgacaat 
ttaaataagt ttaccttttc 
tatatatata tggagagaga 
ctcttagtat tcctttgtat 
atatttgaaa catcaatttc 
gtaatgatga catagttgat 

<210> 35 
<211> 1722 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 204932. 4. dec 

<220> 

<221> unsure 
<222> 45 

<223> a, t, c, g, or other 
<400> 35 

ggggccccta caagcggcca caaggatggc aggcttcgcg gactncgggc tgtcatcgtg 60 
gctcgtggaa caatgtcggc agctgggttt gaagcagccc acgcccgtgc agctcggctg 120 
catccccgcc atcctggagg aagctgtctg aggatcccta tggcatcttc tgcctcgtcc 180 
tgacacccac cagggagctg gcctaccaga tcgcagagca gttccgggtc ctggggaagc 240 
ctctagggct gaaagactgc atcatcgtcg gtggcatgga catggtggcc caggcgctgg 300 
agctctctcg gaaaccacac gtggtcatcg ccacgccggg gcgcctggca gatcacctgc 3 60 
gcagctccaa cacttttagt ataaagaaga tccgcttcct ggtgatggat gaggcagacc 420 
ggctgctgga acagggctgc actgacttca ccgtggacct ggaggccatc ctggcggctg 480 
tgccggcccg caggcagaca ctgctgttca gcgccacgct gaccgacaca ctccgggagc 540 
tgcagggtct ggccaccaac cagcccttct tctgggaagc acaggccccg gtgagcaccg 600 
tggagcagct ggaccagcgc tacctgctgg tggcctgaga aggtcaagga cgcctacctg 660 
gtccacctga tccagcgctt ccaggatgag cacgaggact ggtccattat catcttcacc 720 
aacacgtgca agacctgcca gattctgtgc atgatgctgc gcaaattcag cttccccacc 780 
gtggctctgc actccatgat gaagcagaaa gaacgctttg ccgccctagc caagttcaag 840 
tccagcatct accggatcct gatcgcaaca gacgtggcct cccggggcct ggacatccct 9 00 
acggtacagg tggtcatcaa ccacaacacc cccgggctcc ccaagatcta catccaccga 9 60 
gtcggccgga cggcccgtgg cagggcggca gggtcaggcc atcacgctgg tgacacagta 1020 
cgacatccac ctggtgcacg ccatcgagga gcagatcaag aagaagctgg aggagttctc 1080 
cgtggaagag gccgaggtgc tacagatcct cacacaggtc aacgtggtgc gaagagagtg 1140 
tgagatcaaa ctggaggcgg cccactttga cgaaaagaag gagatcaaca aacggaagca 1200 
gctgatcctg gaggggaagg accctgacct ggaggccaag cgcaaggctg agctgggcca 1260 
agatcaagca gaagaaccgg cgcttcaagg agaaggtgga ggagacgctg aagcgacaga 1320 
aggctggcag ggctggccac aaggggcgtc cacccaggac accgtctggg tcccactcag 1380 
gcccagtccc ctcccagggc ctggtctgag ccccacacgg ccatctgccc agtccttgac 1440 



tcacttggtg tgtctatatg 
gattcatttt ttcttcactc 
tatgcagtaa tatattttgt 
ggttagtggt catgctgttt 
tcatgacact aaaaaatgaa 
acatttgttt ggatttttga 
taaattcata ccctctggcc 
aaagcaaagt aaactcggtg 
ccgtcagcag tgcactcctg 
attccactga ctcatccccg 
cacaattaga ttcagcggat 
acatgatcgc cattcttgat 
caaatatgga atatatggtt 
ctacttgcat ttgggaccat 
ctgtacgcac tggaaggtag 
gatgaagtga aagattatgc 
agatgttttg gtcccatgtg 
ataggatgcg caattcatac 
gcagtttact tggtatgcaa 
aaagtagggg taccatgttc 
ctgtgttttc caggagttac 
ctccaggaaa tataatgatt 
attttgcaca tattatacat 
aaattagtgt ttgtctagca 
tattttctga actctaagnc 



tctggcagac attatcagca 60 
tccaggtgaa tttcaattgc 120 
ggttcagaca tttggggcaa 180 
atttttctct gctatacaaa 240 
tagagattct actgtaggtt 3 00 
agaaagtcaa atcaagcaat 3 60 
ctattttttt tcatagaccc 420 
gcctcttctt ctccacccct 480 
ttctcccttc tctgtgaagc 540 
ccaaccaata atttcactga 600 
atccccaaag ccaggcggaa 660 
tatcataatc aagttcgggg 720 
tgggatgaaa atcttgcaaa 780 
ggaccttctt acttactgag 840 
gaaatatcgc tctattctcc 900 
ttttccatat ccccaggatt 960 
cacacattat acgcagatgg 1020 
ttgccaaaac atgaatgttt 1080 
ctatgcccca aagggcaatt 1140 
atcttgtcct ccaagttatg 1200 
gtcaaactac ctgtactggt 1260 
tctgggaaca tgggcatgta 1320 
attttgtgct aatctttttc 1380 
tgtttgttta atcctttgaa 1440 
taaattaaga tattgtatat 1500 

1520 
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tcgtccatgg agctgagggt cggaggaacc ttccttgggg gcagcagccc ttcccggggg 1500 
cctacccagt gccccacagc agaacccgtg ggcgctcgtg ttgtgcgggc cctgctcctc 1560 
tgccccgaaa ccactggctg gccccttccc tgagccctgg ccaagattca ggctgcaggg 1620 
gaagaaagaa catgaccggg aggttgtgac cccaacccaa ggtcaccccc caggggtgcc 1680 
gcatacagga ggtgcttaat aaacgggtct tttgacttcc tc 1722 

<210> 36 

<211> 1622 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 400607 . 19 . dec 



<400> 36 

tggaccaggc 

ggtgaagtgc 

ccatcataag 

attctagtaa 

tcagtactga 

tgaatgcatc 

cagaaacaga 

atcctcagga 

tgatggctgc 

actggaaaaa 

tgcgttaaac 

cttaacaaac 

gcctgcccac 

cacgtgggga 

gggggcttca 

aagaaacatt 
atgctgttct 
actgggaaaa 
gtattccaac 
atttgcctag 
ctgtaaatgg 
aagacataac 
ttgagattca 
ttgaagaagg 
aaaactggaa 
ttctctttga 
ccaagtacat 
at 



ggccgtggcg 
catcttcggc 
atcgttaact 
tggcagcaca 
gattatgaac 
tacagaagaa 
agcaaatttc 
ttttacaggc 
caggaaggca 
gtatgcagac 
ctctacagtt 
gctgtcattg 
acaacccggt 
tcagaacatt 
ggcaatacct 
ggaccctggt 
agctgcagga 
tgagcaggga 
acagctgtat 
agatctttta 
tcatagtggt 
cgatcctgca 
tcaagaaatg 
tattaaaaga 
agaatactta 
aagatgtgtc 
ggaaaaccat 



cgggtggcgg 
taggtcgtca 
gaagacaata 
ggcaacagtt 
gttacagaaa 
actgaaatgg 
cctccagaat 
tgggtatatt 
tttgacagat 
cttgaaaagc 
tgtcaagctt 
atgctctaat 
cagaatgcag 
gccatgttct 
cttagtgttg 
gatcctgaga 
acagatttcc 
aacctgagag 
agtcatcatt 
actggtgaac 
gatgatggtc 
aagctaatta 
tttaattata 
ccttactttc 
gaatttgaaa 
atatcatgtg 
agcattgaag 



ctgctgtgct 
caggctccgg 
tgcaaaattc 
cagaggtagt 
tggaacagtc 
caagtgctgt 
atgaaaaatt 
tgcttcaata 
ttttcataca 
ggcacgacaa 
tgcagtgcaa 
gggtctgtgc 
ggactgtgcg 
atccaacctc 
acctttggat 
caaacaatac 
gttctgacag 
aagttacagc 
ttcagagatt 
agtttattca 
ctcctggtga 
cagaaataga 
atgagcatga 
atgtgaaacc 
ttgaaaatgg 
ccctctatga 
gagtgaggca 



gggctgtggg 
ctcatggcat 
tcacatggat 
ggtagaacat 
acctgatgac 
ggaccttcca 
ttggaaaact 
tgtagaacag 
ctatccgtat 
cattaaacca 
gcctctgtat 
cctatggtct 
ctttgaagat 
cacccaaatg 
acattatata 
aataagagga 
actgtgggaa 
tatatatgat 
taaagaacat 
gttgcgaagg 
tgatctacca 
aaacatgaga 
agttagtaaa 
tttggaaaag 
gactcatgaa 
ggagttttgg 
tgcttcagca 



gac cggaggc 6 0 
caagtggcat 120 
gaatacagaa 180 
cctactgatt 240 
tctcccaatg 3 00 
gtgacgctga 3 60 
gtagaaaata 420 
gagaatcact 480 
tgctatggtt 540 
tcagatgagg 600 
tacactgcag 660 
gtaacccaaa 720 
caagactctg 780 
gtttatcggc 840 
aacttcttaa 900 
acttttgagc 960 
atgtatataa 1020 
cgtattcttg 1080 
gtacagaata 1140 
gaattagctt 1200 
tcgggaattg 1260 
catagaatca 1320 
aggtggacat 1380 
gcacaactaa 1440 
cgagttgtgg 1500 
attaagtatg 1560 
gagcttgact 1620 
1622 



<210> 37 

<211> 619 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 444248. 7. dec 



<400> 37 

gctttctgag agtcatggac 
tgctggtggc agctcccaga 
gactggtgaa gccttcggag 
gcagtggtta ctactggggc 
ggagtatcta tcatagtggg 
tatcagtaga cacgtccaag 
acacggccgt gtattactgt 
actactgggg ccagggaacc 
tcttcccgct gagcctctgc 
tccagggctt cttcccccag 
gaccgccaga aattcccac 



ctcctgtgca agaacatgaa 
tgggtcctgt cccaggtgca 
accctgtccc tcacctgcgc 
tggatccggc agcccccagg 
agcacctact acaacccgtc 
agccagttgt ccctgaagct 
gcgactttct actatgatga 
ctggtcaccg cctcctcagc 
agcacccagc cagatgggaa 
gagccactca gtgtgacctg 



gcacctgtgg tttttcctcc 60 
gctgcaggag tcgggcccag 120 
tgtctctggt tactccatca 180 
gaaggggctg gagtggattg 240 
cctcaagagt cgactcgcca 3 00 
gagctctgtg accgccgcgg 3 60 
aagtagtggc catatccttg 420 
atccccgacc agccccaagg 480 
cgtggtcatc gcctgcctgg 540 
gagcgaaagg gacagggcgt 600 

619 
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<210> 38 

<211> 499 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 346599. 9. dec 



<400> 38 

ggacgtcctt 

ggcctctaga 

caggaagatg 

caaggtggag 

gcagagcggc 

gcagacactg 

gagggcgctg 

acaactgacc 

caagccccgg 



ccccaggagc 
aagagctggg 
aaggttctgt 
caagcggtgg 
cagcgctggg 
tctgagcagg 
atggacgaga 
ccggtggcgg 
ctgggcgcg 



cggtgagaag 
accctgggaa 
gggctgcgtt 
agacagagcc 
aactggcact 
tgcaggagga 
ccatgaagga 
aggagacgcg 



cgcagtcggg 
cccctggcct 
gctggtcaca 
ggagcccgag 
gggtcgcttt 
gctgctcagc 
gttgaaggcc 
ggcacggctg 



ggcacgggga 
ccagactggc 
ttcctggcag 
ctgcgccagc 
tgggattacc 
tcccaggtca 
tacaaatcgg 
tccaaggact 



tgagctcagg 
caatcacagg 
gatgccaggc 
agaccgagtg 
tgcgctgggt 
cccaggaact 
aactggagga 
gcaagcggcg 



60 

120 

180 

240 

300 

360 

420 

480 

499 



<210> 39 

<211> 1555 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 
<223> Incyte ID No: 



480344. 2, dec 



<400> 39 

cttccccggg 

ctggaagctt 

catggggccc 

cacagcatta 

ccagccaccc 

gaatcttact 

atcatatgta 

agtatattcc 

caccttacac 

caccttatac 

ggccatggag 

gtggatgaat 

gaccctcttt 

gaactcaggg 

cctccccaga 

ctgcttcgca 

gctatcagga 

ttgctctgct 

cattgctcct 

tcatttttgt 

cttgaataca 

tttgatgaac 

cattggacta 

ttttttgaat 

tacttttgta 

aatctattca 



aagagtctca 
ctggatccta 
ctctcagccc 
cttttaaact 
aaagtgtccg 
ggctacatct 
gtagacggtc 
aatgcatccc 
atcataaagc 
ctggagactc 
actgtgatct 
ggtcagagcc 
ctatttggtg 
agtgccagcc 
attttccctt 
aactctaacc 
caaaagctct 
cgtaactcag 
ccaggattag 
atttcaggaa 
agttcctgat 
aggctgatac 
aataacaaaa 
gttttatcct 
aacaaaaatt 
tgaatattca 



gcgcagaagg 
ggctcatctc 
ctccctgcac 
tctggaactt 
aggggaagga 
ggtacaaagg 
aaataattat 
tgctgatcca 
gaggtgatgg 
ccaagccctc 
taacctgtaa 
tccctatgac 
tcacaaagta 
gcagtgaccc 
cagtcacctc 
caccagcaca 
ttatccctca 
ccactggcga 
gaactttttg 
gactggcagg 
aacttcaaga 
cttcatgaaa 
ggataatgtt 
ccagatttat 
gaaacatttg 
tatgtttatg 



aggaaggaca 
cacagaggag 
agagcacatc 
gcctaccact 
tgttcttcta 
gcaaatcagg 
atatggaccg 
gaatgtcacc 
gactagagga 
catctccagc 
tcctgagact 
tcataggatg 
tactgcagga 
agtcaccctg 
ttactattca 
gtattcttgg 
gattactcca 
ggaaagctcc 
ctttcaataa 
agatttatgg 
tcataccact 
ttcaagacaa 
ttcataattt 
gaattttttt 
cttttgctct 
gtaataaagt 



gcacagctga 
aacatgcacg 
aaatggaagg 
gcccaagtca 
cttgtccaca 
gacctctacc 
gcatacagtg 
cgggaggacg 
gtaactggat 
agcaacttaa 
ccggacgcaa 
cagctgtctg 
ccctatgaat 
aatctcctcc 
ggagagaacc 
acaattaatg 
aagcataatg 
acatccttga 
tccaagtagc 
aaaagactat 
ggactaagaa 
agaagaaaag 
tttattggaa 
cttcagcaat 
ctgagtgccc 
tatttgcaca 



cagccgtgct 
cagcagagat 
ggctcctgct 
tgattgaagc 
atttgcccca 
attacattac 
gacgagaaac 
caggatccta 
atttcacctt 
accccaggga 
gctacctgtg 
aaaccaacag 
gtgaaatatg 
atggtccaga 
tcgacttgtc 
ggaagtttca 
ggctctatgc 
caatcagagt 
agccctgatg 
gaaaaggac t 
ctttcaaaat 
aactccattt 
aatgtgctga 
tggtaaagta 
cagaattggg 
agttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1555 



<210> 40 
<211> 1687 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4113 9 6 . 24 . dec 



<220> 
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<221> unsure 
<222> 1653 

<223> a, t, c, g, or other 
<400> 40 

cgcgctactt aaggtcctgc tgcgtgagcc 
ggtgctggcg aacggaggcc ggagtaagaa 
gctgagagat cctgacataa atccttgttt 
ggatgaaaat aactatgaca gggaaaggtg 
ccggagattc tggaattcta tcgtgatgca 
gcctacggca gcagaaagag atgaaatctt 
gtttgcatta aaagtgttta tataacttag 
ctgtaatatt taagactgta cacccctcac 
acagtgaagt caccccgtgt cctttttgct 
ctgttggctg tgttgtggca tgagtttgca 
gctacatgag aaattgagct tttccactgg 
taaagactga atggggctga gatgaaggca 
aagagtgtga ggcaagagaa gctggaacca 
cctagaatct caggtgggtg gagcagtggt 
aaggaatgtg tggtgggcct cagatcccag 
tcacttatct ttgtttaaat aaggttagta 
ataaatgaca taaaaaagaa ctggttgtct 
ctgtgattaa aatccttaaa tattgggttg 
ccacagtaat gggaagaaag acaaatggat 
atgagttggg cagggaaaag ggtcagggtt 
tagaacttca ctttatgata taaacacaat 
ctgtgaagca gccctcctta tgcccactgc 
gttttatgaa gacatactat tattgaatga 
tgtcccttct cttcatctgg tcttgggtca 
aactgtctgt ctgagtctaa aaccaagctc 
aaatgattca ttttttattc aaatataact 
aaacctgagc ttggttttag actcagacag 
tggtcttggg tcaagaacat tgttttaatg 
gatgaag 

<210> 41 

<211> 3334 

<212> DNA 

<213> Homo sapiens 



attgacgtgt ttggagctgg agacggcctg 60 
gactgttaga atgccctcgg taacacagag 12 0 
gtcggaatct gatgcttcca ccagatgtct 180 
ttccacttac ttcttgaggt acaaaaactg 240 
gagaagaaag aacggagtga agccatttat 3 00 
gagagcagtg ggaaatatgc cctattgaat 3 60 
aagcagatga atatttctaa taaatgattg 420 
ccagacagac cttaagttct tcaagtggag 480 
tgctctcagt gccatgccga tggtatgttg 540 
tgactttctg gaggcatgga gttaggtaag 600 
gttttgaaag aagagtatga tgtgaacaag 660 
atgtttccaa ggaaaggaaa tgttatgagc 72 0 
cattcagaga gtatcctgta gattgctcca 780 
gggagaagac tggaaaggta agttgaaggt 840 
gctcattcct caaatcactt cttacttccc 900 
cattcactag gggcaaatgg gtttttctaa 960 
ttctttaaag aaagtcttag ttacaggatt 1020 
ccttctgcag ctgtagtatc agtttttgaa 1080 
tcccttagaa gtaagatttc tatttgcagg 1140 
catcaggtga actcaacact gggatgagac 1200 
acgattattc aatgtggtga ctggggtaga 1260 
cttttagaat cgtttgtttt attcatggta 1320 
aatttaatgt gtacttgaaa acattgcttt 13 80 
agaacattgt tttaatggct gccgacaatg 1440 
aggtttctaa gccacatgac cttgattgtt 1500 
acatttaaca atcaaggtca tgtggcttag 15 60 
acagttcatt gtcggcagcc attaaaacaa 162 0 
gcngccgaca atgttcttga cccaagacca 1680 

1687 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 3 0281 9. 4. dec 

<220> 

<221> unsure 

<222> 122, 163, 170, 177 
<223> a, t, c, g, or other 



<400> 41 

atttctagaa tctgttgggg ttcctgagag agagagagca cctgccactg gcattgaaaa 60 
agatgtcctg gcgtccgcaa taccgtagct ccaagttccg gaatgtctac gggaaggtgg 120 
cnaaccggga gcactgcttc gatgggatcc ccatcaccaa gantgtgcan gacaacnatt 180 
ctgtgccgtc aacacccgct tcctggccat cgtcaccgag agcgcagtgg ggcggctcct 240 
tcctcgtcat ccccctggag cagacaggca ggattgaacc caactacccc aaggtctgcg 3 00 
gccaccaggg caatgtgctg gatatcaaat ggaacccctt catcgacaac atcattgcct 3 60 
cgtgctcgga ggacacgtcg gtgcggatct gggagatccc cgagggcggg ctgaagcgga 420 
acatgacgga ggcgctcctg gagctgcacg ggcacagccg gcgtgtgggg ctggtcgagt 480 
ggcaccccac caccaacaac atcctgttca gcgctggcta cgactacaag gtcctcatct 540 
ggaacctgga tgtgggtgag ccggtgaaga tgattgactg ccacacggga tgtgactccc 600 
tcttgcattg tcctttcaaa cacggacggc agccctgctc accaccacgt caaggacaag 660 
aagctgcgtg ttgattgagc cccgctctgg cccggttggt tcctggccag gaaggccaac 720 
tgcaaaacca cagagtaacc gggtggtgtt cctggggaac atgaagcggc tcctcacgac 780 
aggggtctcc aggtggaaca caagacagat tgccctctgg gaccaggagg acctctccat 840 
gcccctgatc gaagaggaaa ttcatgggct ctctggcctc ctgttcccct tctatgatgc 900 
tgacacccac atgctctacc tggctggaaa gggtgatgga aacatccggt actacgagat 960 
cagcactgag aagccctacc tgagttacct catggagttc cgctccccag ccccgcagaa 1020 
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aggcctaggg gtcatgccca agcacgggct ggatgtgtca gcctgcgagg tgttccgctt 1080 
ctacaagctg gtgactctca agggcctgat tgagcccatc tccatgatcg tgccccggag 1140 
gtcagattcc taccaggaag acatttaccc aatgacacca ggcacggagc cagcactgac 1200 
cccggatgaa tggctgggag gcatcaaccg agatcccgtg ctgatgtctt tgaaagaagg 1260 
ctataagaag tcctcaaaaa tggtatttaa ggctcccatc aaagaaaaga agagtgttgt 1320 
ggtcaacgga atagatttat tagaaaatgt cccacccagg acagagaatg agctccttcg 13 80 
aatgttcttc cggcagcagg atgagattcg acggttgaaa gaggagctgg cccagaagga 1440 
catccgcatt cggcagctcc agctggaact gaaaaacttg cgcaacagcc ccaagaactg 1500 
ttagctcccc agctgggctg ttttctaagc cgatctctcc gtcgtttcta ctcatccctt 1560 
aacttctccc ttaccagtga ccccagagac agagccagga caggagtggg ggccagcctg 1620 
aggacccccg cctaccacct cgagaactgg aagccaacct ctaacctcct gacctcatgc 1680 
taataaaagt ccccagcttc tggagacccc ctgccggcag cgccctttcc ctgccacccc 1740 
aggagccagg cttcccctca gctgggtgaa gactacagac tccctggggt tggcaggggc 1800 
tccatctcag tggaccagga agcaagaggg gaagcgggat cccagctaga cttagaactt 18 60 
ggacttttcc cctgtgaagg gggctgccag gacatctcag cactcccgcc tggagctctc 1920 
agcatcactg aaggtaccac agtgtaagtg ctggactgca ggctgcagtg atccctcttt 1980 
cgtcccaccc cctcttccct cagcagcccc ggaagcctgc ctcacccgac gaggacagcg 2040 
agcggcccgg ctcctttctg tctcttccct tcccgccctc ttgtcttcag ggaattcaga 2100 
ggattgctct ccaaggccat aatgacccct tgccttcccc atgattctct acaaagctct 2160 
tgcacaccct tttcccattc aatttgtgag ccaggcaggg tagggattag tgtccccctt 2220 
tgacaaatga cagaactgag ggttgcaatg gggaaatgac ttataaagtc acccagcagg 22 80 
tcaacaatgg gcccacgacc aagaccctgg gtgttcagac cccaaggcca gggcctttcc 2340 
cgctgcatca agatgccaat ccctttgtgg gcttcaccag tgcccaagtc tctatggaga 2400 
atgagaactg gaagccactg ctaccgtcta cccagcacca gtagtgccga tgtgccacac 2460 
tgcccagttg aggcccctca cgctctgtgc ccctagatcc ttcaggtccc caccctcagc 2520 
tgtcaccacc accctcccca ggggactcca tctgagatga ggcctcgtcc tcctggaagc 2580 
tgaggctgag aagggtggag cttggccctg gggaaggcag accagggtct gatggcttct 2640 
agggatgctc tgcgtgtgtc tcagcaccgc tatctcagcc actttcagcc ttatgcacgt 2700 
agaatgacca cagccactcg catccgtata gcactttaaa gtttctgcag tcctttgaca 2760 
cataggatct catcgagcct cacgtctact cccttctgca gatgaggaaa ccgagagaag 2820 
tggcccaagg tcacgcaact ctgagatgcc acatttcatt tgatcttgta cacattttct 2880 
tttattcctt cttttttcct cctttcattt cccactacgc acaaagagtt tataaacact 2940 
gttctcagaa gagtcacagt ttggggtgag atctggaaat caagaaatgg gtgtccactc 3 000 
ttttctttca ttagctagga tctactagat gcattatact ccatacctgc ttttcccatg 3060 
gccgccctac ggaaaatccc atccacagag gccagggcta cccaagcccc tccaggtgag 3120 
ctgggccttt cc tttatgaa cctccatcct cccagccagc tacagtaggg cctcctcacc 3180 
ccgtacccca cagctagaca gtgtcagcac tcatctcctc ctcccacatt tctggagctt 3240 
ttttttttcc ttccccattg acctttgtgg tcttctgtga ttatttatgc tgcctcccaa 3300 
ggatagaatt gaaataaaat gttttcaact tatc 3334 

<210> 42 

<211> 2248 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 238734. 2. dec 
<400> 42 

gcgatctgag tagccagcgt cgccggcgac cgcggagttc tgggctagtg ggaccccgcg 60 
cgggctggtt cgggatgagc gatggcatcg gtcaaggtgg ccgtgagggt ccggcccatg 12 0 
aatcgcaggg aaaaggactt ggaggccaag ttcattattc agatggagaa aagcaaaacg 180 
acaatcacaa acttaaagat accagaagga ggcactgggg actcaggaag agaacggacc 240 
aagaccttca cctatgactt ttctttttat tctgctgata caaaaagccc agattacgtt 300 
tcacaagaaa tggttttcaa aaccctcggc acagatgtcg tgaagtctgc atttgaaggt 360 
tataatgctt gtgtctttgc atatgggcaa actggatctg gaaagtcata cactatgatg 420 
ggaaattctg gagattctgg cttaatacct cggatctgtg aaggactctt cagtcggata 480 
aatgaaacca ccagatggga tgaagcttct tttcgaactg aagtcagcta cttagagatt 540 
tataacgaac gtgtgagaga tctacttcgg cggaagtcat ctaaaacctt caatttgaga 600 
gtccgtgagc atcccaaaga aggcccttat gttgaggatt tatccaaaca tttagtacag 660 
aattatggtg acgtagaaga acttatggat gcgggcaata tcaaccggac caccgcagcg 720 
actgggatga acgacgtcag tagcaggtct catgccatct tcaccatcaa gttcactcag 780 
gctaaatttg attctgaaat gccatgtgaa accgtcagta agatccactt ggttgatctt 840 
gccggaagtg agcgtgcaga tgccaccgga gccaccgggg ttaggctaaa ggaaggggga 900 
aatattaaca agtccctcgt gactctgggg aacgtcattt ctgccttagc tgatttatct 960 
caggatgctg caaatactct tgcaaagaag aagcaagttt tcgtgcctta cagggattct 1020 
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gtgttgactt ggttgttaaa agatagcctt ggaggaaact ctaaaactat catgattgcc 1080 
accatttcac ctgctgatgt caattatgga gaaaccctaa gtactcttcg ctatgcaaat 1140 
agagccaaaa acatcatcaa caagcctacc attaatgagg atgccaacgt caaacttatc 1200 
cgtgagctgc gagctgaaat agccagactg aaaacgctgc ttgctcaagg gaatcagatt 1260 
gccctcttag actcccccac agctttaagt atggaggaaa aacttcagca gaatgaagca 1320 
agagttcaag aattgaccaa ggaatggaca aataagtgga atgaaaccca aaatattttg 1380 
aaagaacaaa ctctagccct caggaaagaa gggattggag ttgttttgga ttctgaactg 1440 
cctcatttga taggcatcga tgatgacctt ttgagtactg gaatcatctt atatcattta 1500 
aaggaaggtc agacatacgt tggtagagac gatgcttcca cggagcaaga tattgttctt 1560 
catggccttg acttggagag tgagcattgc atctttgaaa atatcggggg gacagtgact 1620 
ctgatacccc tgagtgggtc ccagtgctct gtgaatggtg ttcagatcgt ggaggccaca 1680 
catctaaatc aaggtgctgt gattctcttg ggaagaacca atatgtttcg ctttaaccat 1740 
ccaaaggaag ccgccaagct cagggagaag aggaagagtg gccttctgtc ctccttcagc 1800 
ttgtccatga ccgacctctc gaagtcccgt gagaacctgt ctgcagtcat gttgtataac 1860 
cccggacttg aatttgagag gcaacagcgt gaagaacttg aaaaattaga aagtaaaagg 1920 
aaactcatag aagaaatgga ggaaaagcag aaatcagaca aggctgaact ggagcggatg 1980 
cagcaggagg tggagaccca gcgcaaggag acagaaatcg tgcagctcca gattcgcaag 2040 
caggaggaga gcctcaaacg ccgcagcttc cacatcgaga acaagctaaa ggatttactt 2100 
gcggagaagg aaaaatttga agaggagagg ctgagggaac agcaggaaat cgagctgcag 2160 
aagaagagac aagaagaaga gacctttctc cgcgtccaag aagaactcca acgactcaaa 2220 
gaactcaaca acaacgagaa ggctgaga 2248 



<210> 43 

<211> 1723 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 399525. 3. dec 
<220> 

<221> unsure 
<222> 30 

<223> a, t, c, g, or other 



<400> 43 

ggacgcggcg ctggatttca agttggcggn 
tgcctctggc agtgacgagg acgaatgtcc 
gatctggata acccagtgct taaaacaaca 
ggagcagact tacgcactgt ggatccagag 
gcagcaggaa ttggcaaatt gtcaactgct 
ctccggagac tgacatcctc agttagttgt 
cggatgaaag cagaaaacag ccacaatgca 
gcttgttcag atggggatgt taatgctgtt 
aatgaacata cagaagaagg agaaagcctg 
gaattagcac aagtattgct tgctatgcat 
gacataactc ccctgatggc agcttccagt 
cttcttcatg atgctgatgt caactcccag 
gcatgtgctg gaggatttgt tgacattgtt 
gaagatcata atgaaaatgg acatactccc 
gaagttgcaa gagttctttt agatcatggt 
aaagaaagtg ctctaacact tgcttgctac 
cttgaagctg gtgcagatca agagcacaaa 
gcctgcatgg atggacatgt agaggtggca 
aacatgcctg cagattcatt tgaatctcca 
gaattggcag ctctacttat tgaaagggga 
tacactccct tgatggaagc tgcccgggaa 
agcacaagga gcaaatataa atgcccagac 
ggcttgctgt ggaggatttt ctgaagttgc 
agaacttggc tgctccacac ctctgatgga 
taaatatttg ctggcttctg gcgctaatgt 
cttaacctat gcttgtgaaa atggacatac 
ggctgattta gacaagcagg aggacatgaa 
gcatcagcca agaagttcca ggaccctgct 
aaagattatg aagtcccaat aaaaagagat 



tgccgtgctg aggaccgggg gtggaggtgg 60 
gaggttgaat catttatttt ggaccaagaa 120 
tcagagatat tcttatcaag tactgcagaa 180 
acacaggcac gactagaagc attgctagaa 240 
gatggtaaag cttttgcaga tcctgaggta 3 00 
gcactggatg aagctgctgc tgcactgaca 3 60 
ggacaagtgg acactcgcag tctagcagaa 420 
cgtaaattgc tagatgaagg cagaagtgta 480 
ctgtgtttgg cttgttcagc agggtattat 540 
gctaatgttg aagatcgagg gaataaagga 600 
ggaggttact tagatattgt gaaattatta 660 
tctgcaacag gaaacactgc gctaacttat 720 
aaagtgctcc ttaatgaagg tgcaaatata 780 
ttaatggaag cagccagtgc aggtcatgtg 840 
gcaggcatca acactcattc taatgaattc 900 
aaaggccatt tggatatggt tcgctttcta 960 
acagatgaga tgcacactgc cttaatggag 1020 
cgtttgcttt tggatagtgg tgctcaagtg 1080 
ttgacgctag ctgcctgtgg aggacatgtt 1140 
gcaaatcttg aagaagttaa tgatgaagga 1200 
ggacatgaag aaatggttgg cactactctt 1260 
agaagaaact caagaaactg ctcttacttt 1320 
agactttctt attaaggcag gggctgatat 1380 
ggcatctcag gagggacacc tggaattggt 1440 
gcatgctaca acagcaacag gagacacagc 1500 
ggatgttgca gatgttttac ttcaagcagg 1560 
gactattttg gagggcatag atccggccaa 1620 
gggtgacaaa ggaaatcctc ttcaattgaa 1680 
ttgtattgct ggt 1723 
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<210> 44 

<211> 1383 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 222795. 6. dec 

<220> 

<221> unsure 

<222> 559-560, 569-570, 1009-1012, 1022, 1056-1057 
<223> a, t, c, g, or other 



<400> 44 

cagacgcgcc gaccatgtcg gcagccaagg agaacccgtg caggaaattc caggccaaca 60 
tcttcaacaa gagcaagtgt cagaactgct tcaagccccg cgagtcgcat ctgctcaacg 120 
acgaggacct gacgcaggca aaacccattt atggcggttg gctgctcctg gctccagatg 180 
ggaccgactt tgacaaccca gtgcaccggt ctcggaaatg gcagcgacgg ttcttcatcc 240 
tttacgagca cggcctcttg cgctacgccc tggatgagat gcccacgacc cttcctcagg 3 00 
gcaccatcaa catgaaccag tgcacagatg tggtggatgg ggagggccgc acggggccag 360 
aagttctccc tgtgtattct gacgcctgag aaggagcatt tcatccgggc ggagaccaag 420 
gagatcgtca gtgggtggct ggagatgctc atggtctatc cccggaccaa caagcagaat 480 
cagaagaaga aacggaaagt ggagcccccc acaccacagg agcctgggcc tgccaaggtg 540 
gctgttacca gcagcagcnn gcagcagcnn gcagcagcaa catccccagt gctgagaaag 600 
tccccaccac caagtccaca ctctggcagg aagaaatgag gaccaaggac cagccagatg 660 
gcagcagcct gagtccagct cagagtccca gccagagcca gcctcctgct gccagctccc 720 
tgcgggaacc tggggctaga gagcaaagaa gaggagagcg ccatgagtag cgaccgcatg 780 
gactgtggcc gcaaagtccg ggtggagagc ggctacttct ctctggagaa gaccaaacag 840 
gacttgaagg ctgaagaaca gcagctgccc ccgccgctct cccctcccag ccccagcacc 900 
cccaaccaca ggtacagttg ccccgagtcc gccctcccag gagctcggtg gtcctcttcc 960 
ttccccaggt cctcgactcc cccaccaaat ggtctgcagc atctccctnn nntccttgga 1020 
cntggccagc cagccacctg cctacgtgga ctctgnnagc actagggggc gggggacaga 1080 
gagactgggg agcgcctttg cctttaaagc cagcaggcaa tatgccaccc tggccgacgt 1140 
ccctaaggcc atcaggatca gccaccgaga agccttccag gtggagagaa ggcggctgga 1200 
gcgtagaact cgggcccgga gccctggcag ggaggaggtg gcccgtctgt ttggcaacga 1260 
gcggaggagg tcccaggtga ttgaaaagtt tgaggccttg gacattgaga aggcagagca 1320 
catggagacc aatgcagtgg ggccctcacc atccagcgac acacgccagg gccgcagcga 13 80 
gaa 1383 

<210> 45 

<211> 2027 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 410628. 5. dec 



<220> 

<221> unsure 

<222> 20, 1971 

<223> a, t, c, g, or other 



<400> 45 

cagaaagtag aacgagagtn acaactgaaa 
gaggttaaag ctcaaagaat tcaacttcag 
ggattactaa ctcctgttgg agttggagag 
cagcaagtgg atcctgtttt acttaaagat 
tttgcgccaa tccagcctct ggcgatgcct 
cctccagggt ccatcgcaaa tcttacagaa 
ctgggccaag cacagttggc agggctgggg 
ttaatggtag ccagccctgc tcagaccctc 
gtcagtggaa gagcatctgc aatgtcaaac 
tcccaacctc agactccaac tccaagtcct 
taccctgcca ttgatattga tgcacagact 
gcctgtgctg gtggccacga ggaactggta 



actcagcagc agctaaaaaa gcagtatcta 60 
caacagcagc aacagtcttg ccaacacctg 120 
cagctttctg agggagacta tgcacggtta 180 
gaaccccagc agactgctgc tcagatgggt 240 
caagctttgc ctctggcggc aggtcccttg 3 00 
ctgcaaggag tgatagttgg acagccagta 3 60 
caaggaattc tgacagaaac acaacaaggg 420 
aatgacacgc tggatgacat catggcagca 480 
actcctaccc acagtattgc tgcatccatt 540 
atcatctctc cttcagccat gcttcctatc 600 
gagagtaatc atgacacggc gctaacactt 660 
caaacactgc tagagagagg agctagtata 720 
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gagcaccgag 
ggtgttgtgg 
aaggacacac 
ttagctcgag 
gctgcttctg 
aactctagaa 
catacagctg 
accaatcgga 
cttctgcttg 
atggaagctg 
gatgttaatg 
aaagggcatt 
aacaagaagg 
cagttactgg 
cttatggcag 
aatcagtttc 
atgctgaaga 
gctgaagcaa 
gaagaaagtc 
aagaaaaagg 
tttgaactcc 
ttgacagaac 



acaagaaagg 
aaatattgct 
cactctcctt 
gggcaaataa 
gtggctatgt 
ctggtagcaa 
ctgttaagct 
acactgccct 
atagaaaagc 
cctctggtgg 
cccctccagt 
acaaattctg 
ggaacactcc 
tgcaagcagg 
catttagaaa 
catcagattc 
agtgtcatct 
acaaaaacgc 
ggaggctggc 
aagaacaaag 
aagctgctca 
ctccaagtgc 



ttttactcca 
ggacaatggt 
ggcttgttct 
agagcacagg 
gaacatcatc 
attgggcatc 
cctgttagac 
tactttagcc 
aaatgttgaa 
atatgcggag 
tccctcctca 
tgagcttctt 
attgtggcta 
tgcagatgtg 
gggtcatgtg 
tgaatgtatg 
ttgtatggag 
cagcattttg 
tttggctgcg 
aaggaaac ta 
agaaaaagaa 
ccacaaccac 



ctcatcttgg 
gcagacattg 

gggggaagac 

aatgtttctg 
aaaatattac 
tctcctctga 
atgggctctg 
tgcttccaag 
cacagagcta 
gtgggccgag 
agagatacag 
attggcaggg 
gcagcaaatg 
gatgcagcag 
aaggtggtgc 
agatacatag 
tcaatagtac 
ttagaggagt 
aaaagagaaa 
gaagaaattg 
aagcttaaag 
tactaccata 



ctgccacagc 
aagcccagtc 
aggaggtggt 
attacacacc 
taaatgcagg 
tgttagcagc 
acataaatgc 
gaagaactga 
agactggtct 
ttcttttgga 
ctttaaccat 
gagctcatat 
gtggacacct 
ataaccgcaa 
gctacttagt 
caaccatcac 
aagccaaaga 
tagacttgga 
aaagaaaaga 
aagccaaaaa 
ttgaagatga 
ggtatat 



tggtcatgtt 
tgaaagaacc 
ggagctattg 
tctaagtctg 
agctgagatt 
tatgaatggg 
tcagatagaa 

agtggttagt 

cacaccacta 
taaaggtgct 
agcagcagat 
tgatgtacgt 
cgatgtggtt 
gataactcct 
caaagaagtc 
tgataaggag 
tagacaggct 
aaagttaagg 
gaagagaagg 
taaagagaac 
ncctgaagtc 



<210> 46 

<211> 6968 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 053 64 9. 6. dec 



<220> 

<221> unsure 
<222> 6955, 6964 
<223> a, t, c, g, 



or other 



<400> 46 

gcagcgggcg 

ggcgccgctc 

cgcggcgggc 

ccggtgatct 

tttacctctg 

aggtcctatg 

gaggagctgc 

tactggacag 

aagtggcaga 

aatgatgcca 

ataagaaatg 

aaggactctt 

cttcttatgt 

gagaagagga 

gcctttgttt 

gacgccctct 

acttttttct 

atggagaagc 

ctgttctctc 

cgcgccctag 

ccacctggtg 

gaaggctgct 

agctatgcca 

ggacctaact 

cgaccagaaa 

tcagtggaag 

acaagcattg 

atgaggaaaa 

ctggctgagg 



gcagctgcgg 
aggctcgggc 
cgggccggtg 
acaccatgga 
tttatccaac 
gccgggctcc 
tgcccaaaga 
agtgctgtga 
atgttctgaa 
agaaaaaaaa 
atttttgtaa 
ctcgaactca 
cttttaccaa 
ctgagccagg 
tcgagatgct 
tctctcagta 
gccagccagt 
gggaatcgat 
gccagtgcac 
agctgctgca 
ctctggactg 
gtgaccgggc 
cagaaaagtt 
cagaagatct 
cagccaacac 
cttttgaaaa 
ggaggattcg 
aggctccaca 
gctgggcact 



cgcaaccggc 
tccggctggg 
acgccggacg 
gaacaagccc 
gctctctcag 
gaagatgatt 
aggaaacaaa 
taccgaagtg 
ggc teat age 
caaaaccaac 
taaacagagt 
ggaatcctgg 
aaacctaggc 
ctggagcttt 
gcagcagttc 
tgtggtcaac 
gaagagctgg 
ecagaggega 
cttgctgctc 
caactgegtg 
ctgggtgttt 
acagatcgac 
aaagtccttg 
caacaggaca 
agctcagagt 
acactactta 
atetgetaag 
aaaggcagaa 
ccccatcaca 



teeggagctg 
cccggcgcgg 
cccatggacg 
atcgtcacct 
cagcttccaa 
cacctagagt 
gctctgctca 
tataaagcta 
tctgtggact 
atccttcccc 
gacaggtgtg 
aatgccttcc 
aagtttgagg 
tgtgaatatt 
gaggacgccc 
ttcggggccg 
aacggattga 
gaagccaccc 
ttcctgeaga 
caggaac tga 
ctgagctgtc 
tcaaacattg 
ggctatctat 
gttgaccttt 
ccttataaga 
gatttgtccc 
tttgttggaa 
atctatcttc 
cacacaagga 



cctggcgcgg 
cctcggggct 
cctctgagga 
gtgctggaga 
gagaaccaat 
ctaactttgt 
cgtttccctt 
cagtaaaaga 
ggttaatagt 
gaacctctat 
ttgtgctctc 
tgaccaaact 
atgacatgag 
tcatggttca 
tggtgcagta 
gggatggtgc 
tcctccgaaa 
tgttagatct 
ggccgtggga 
agctcttaga 
tggaggtgtt 
cccacactgt 
gtggacttgt 
tggcaggttt 
aactgaaaga 
atgccaccat 
aagatctggc 
aaggagcact 
agcagctggc 



ccgggcgggc 
gcccatcgcg 
gccgctgccg 
tcagaattta 
ggaatggaga 
tcaattcaaa 
cctccatatt 
tgacctcacc 
gatagttgaa 
tgtggacaaa 
cgaccccttg 
caggacattg 
aaccttgagg 
ggaggagctt 
cgacgaactg 
caactggctg 
acccatagat 
gcgcagttac 
ggtggcccag 
agtctctgtc 
gcagaggata 

ggggctatgg 

gtcagagaaa 
gggagctgag 
agcattatcg 
tgaaatgtat 
agagttttac 
gaaaaactac 
cgaatgtcaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2027 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 



30/50 



WO 01/21836 



PCT7US00/25643 



aagcaccttg gacaaattga aaactacctg 
cacctcactg aagaggagcg caagcacttc 
ccgtcagaca gcccaggtca taagatagtg 
gatctccatt ttgatccctc caatgccgtg 
ataaccatgt acagccagat gcctgtgcct 
cacttcagca tggagaaaaa cagctaccgg 
acgtccaatg ggatcattaa ctttccaccc 
agtttgcccg cgctggagtt gtatgaaatg 
aacacgactg ggattatctg cagaaacgtc 
tcctctctag agatgccctc aggggtggct 
agccacgtga ccctggaacc aggggccaac 
acctggaacg tatacactca ggcagctgtg 
ccctcacatc taccccattg tgcagtacga 
ggagccgctg gctgatagcc ttctggcagg 
taccggccat tatacgataa agaatggaga 
gctcatcctg tgccaggcgg agagcagggc 
ttctgaggcc gcgctccgga ttcagtcctc 
tgcgcctgcg taccacgtga tcgaatttga 
agcactcgga ggggagagtg acatgctggg 
ccaacagaga actggccgct gcatggttac 
gtggtccatc tactccacag tcatcgcact 
cage c tec tg tcctcaggaa caeggaaata 
acttgacttt cagctgtcag atagttatct 
actagtacca ctgaacacgc agtcccagca 
cgtctgggaa ctcaagtgga cagaagagee 
tggattttcc ccagcttctg aggaacagct 
atttaaagtg gaaaattttt ttacattata 
gggaatggag tattgeagaa caggctccct 
ctcagacctc ttggaggtgg ataaagatga 
gacaaagctt atgtatgaag ttgtcgacaa 
ctgcggtgtc atctccatgc cagtggctgc 
gatgccgctc ttcgccgggt atctccccct 
ccatcattct gcacactcct cccaactgga 
gtcagtagac aagcacgggg acgaccagcc 
cgtgcattcg gcctgcagca gcgagcacaa 
gccggccggc caggtcttca actccagctc 
agatgaccac gtcctggaag tcagtgtaac 
tcccagctag gagtgeaett tatgggactg 
agaccttcag tggtcctgcc tggccgtggg 
cagctgagtt ctgttgttgt tttcatgccg 
cccgtcccca tgctggtgtc cgacgccgct 
ctcacacgct gctcatgaaa gtttccaccc 
ctggctcttc gtggaaggcc atttgtcttt 
tagtgacgtg tgtgcctttc tggtcacaca 
gtgtccccta aggggtaaaa cagcegttta 
ataacaccct tctaggggag gcgggtgggg 
tgggatgccg gagctggggg acctggaggc 
aagaaagctg agacccggct gtgaggagcg 
gtttctggaa ccctgtggtc ccccataatg 
aatatattta atttaatgat cattatatat 
tttaaagtct ttacatgtaa ttgtagctgt 
tcttagaaat taagatattc categtgagg 
tggttgtagg gaaggactca agtcctggaa 
acactgttcc aatggattat ttgetttegg 
tattattttt aaaaatacag actaactgat 
tcaaggacaa aaataagtgt gtttgttttt 
gtgttgtatg tattaaacct tttccagttc 
atagagtact cttaatttac tgggcaaatg 
caagcagata gagtegggtc cagcagtggg 
aagttgtgga gagaaggagc accgggttcc 
gggcatggga ggaaggaggc acagaceggg 
tctgtgttgt cttttagctc attcaataaa 
tctttttcat ttgtagaaac tggagacgtg 
tttccagaaa tttatcttcc tcatgtgcag 
attagaactt cagaatctaa tgatagttta 
aaattgacga ttgccctgca tttctaccaa 
ttcegtcaga agagactgta atccgtgtcg 



cagaccagca gcctcttagc cagtgaccac 1800 
tgecaggaga tacttgactt tgccagccag 1860 
ctacccatgc attcctttgc acaactgega 1920 
gtccacgtgg gcggcgtttt gtgcgttgag 19 80 
gttcacgtgg agcagattgt ggtcaatgtc 2 040 
aagactgegg agtggcttac caagcacaag 2100 
gagaccgcac ctttccctgt atcccaaaac 2160 
tttgagagaa gcccatctga taactccttg 2220 
cacatgctcc tgagaaggca ggagagcagc 2280 
ctggaggagg gtgcccacgt gctgaggtgc 2340 
cagataacat tcaggactct aggecaagga 2400 
cgcctcggtg ggctccgtgt ggttcgtcct 24 60 
cgtgtactca caggagcccc agetgeaegt 2520 
cattcctcag agagtcaagt tcactgtcac 2580 
cagcctgcag cttagcaatg ccgaagccat 2640 
cgtggtctac tccaacacga gagaacagtc 27 00 
cgacaaggtc acgagcatca gtctgcctgt 27 60 
actggaagtt ctctctttac cttcagcccc 2820 
gatggcagag ccccacagga agcataagga 2880 
cacagaccac aaagtgtcga ttgactgccc 2940 
gaccttcagc gtacccttca ggaccacaca 3 0 00 
tgttcaagtt tgtgtccaga atttgtcaga 3 0 60 
tgtagatacc ggtgatagta ccgacctgca 3120 
gcccatctac agcaagcagt cggtgttctt 3180 
tcccccttct ctgcattgcc ggttctctgt 3240 
gtctatctcc ttaaagccgt atacttatga 3 3 00 
caacgtgaag gctgagatct ttcccccttc 33 60 
ctgctccctg gaggttttga tcacgaggct 3420 
agcactgact gaatctgatg agcatttttc 3480 
cagtagcaac tgggcagtgt gtgggaaaag 3 540 
tcgggccact cacagggtcc acatggaagt 3 600 
gcccgacgtc aggctgttca agtacctccc 3 660 
cgctgacagc tggatagaaa acgacagcct 3720 
ggacagcagc agectcaaga gcaggggcag 3780 
aggcctaccc atgccccggc tgcaggcact 3840 
gggcacacaa gtcctggtca tccccagcca 3 9 00 
atgacaaege cagggtgaac acacgccact 39 60 
tgactggact cttccgttct ggctccagcc 4020 
gacatcagag agtgtcatca cgcagctggc 4080 
cctgtgatct cagattcctg cttttctcac 4140 
tactcagagc cctggcctcc ctccccctac 4200 
acgctgtctc caeggaacag cctccgtctg 42 60 
caggtagaca ctcagcagcc ctcacggtct 43 20 
gctgcccagt ttcctgatcg gggtggattt 43 80 
ccgcagatcc tctcattgtg cttttctaga 4440 
gagggaggga tcataacccc ttctgtgcct 4500 
ccatcagccg gagecaegtg aaaggtactg 45 60 
cctcagcggt gaggtggttt agggataaat 4620 
ttgatagaat ateatatgea ctgggagtta 4680 
gtgggggtta atatgttgtt tttctgtccc 4740 
ataategtta tttttctttt gcatcttaag 4800 
atgagagagg tcctcagtgt gtttttggtc 4860 
tgtcctccac tggtctactg agttgcagtc 4920 
ttgtaaattt aattgtacat atggttgatt 4980 
gtaatgttta tgtataagtt gcaccaaaaa 5040 
acaggtgtga aagtcacagc ttgtaaataa 5100 
tecaaagega tgtatttttg tacacttgaa 5160 
tgcttggaat tgaacttgac aagattagct 5220 
tggccctcgt gtgaatcccc gtggatgtgc 5280 
tgcccagcac tgtgcttgcg ggaggcggtg 53 40 
gaaatatgac ageegtcatt tccagtattc 54 00 
taaaggtggt gtgatttttt tttcctcctg 5460 
taaagaagat aaataattgt gtaattaaac 5520 
tttaacaaac ttggtcaaac tagttagcaa 5580 
gggtttctaa aataaggttt tttattgtaa 5640 
gtcctgtgaa taaagagatg ggagatttga 57 00 
tcagcctggg agccttcccc agtgtaatgt 5760 
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agctttctct 
gtatctggtt 
gtgaagggga 
atttatcagt 
atgcttctgc 
tgttcatcat 
agtaaggtgt 
tgaagccaga 
tctgtggaac 
ttcatgggat 
ttgtatttaa 
gggtcagatt 
gcagagaaaa 
aaaatgaaat 
aaagggcaaa 
ctacctggtt 
ctatatacat 
tgtgactgga 
tttactatga 
ttaagacttt 
aaanaaaa 



cttaccttct 
ctactcccag 
aatttaaaat 
ttttttcccc 
ctctttaaga 
aaaagtccta 
atgtgtttga 
tatgcaatgt 
atgtaaatac 
gtcctatgat 
tctgaaaatg 
cctgtattaa 
tacacacagg 
cagacactgc 
ctgggctgat 
tttaaatttt 
caggctgtaa 
aaaggtgaca 
aaagtcattt 
aaaatgtaag 



ggaagaggga 
gatgaaatta 
gtttacggcg 
ctaattaccc 
caaagaagaa 
aaagagtaac 
gacaagctgg 
gtatctaaaa 
tgtcaaatgg 
tggaaaaatt 
actttaccta 
ttactcttag 
cttctattca 
aggtttgtat 
gtaaatgttt 
atttttcatg 
gacccccccc 
agttggtgac 
ttcatattat 
cattgtaatt 



atgtttcatt 
tccaactaca 
cttaattgcc 
aaaagatctt 
tgtcacccaa 
tgtaattgga 
ttttgttgat 
agcaaggaat 
aaaataaaat 
ataactcttc 
caacagttcc 
tggagatgcc 
aattttcttt 
ataaaatgaa 
tactttcaac 
aaaatgctcc 
agtcatcatt 
tttgacactg 
gtaatatatt 
ctgaaaatac 



tattactgtt 
tatatattta 
tggaaatgaa 
ttgcaaacta 
aattgtcatt 
tgtttattgt 
aaagagatgt 
ttgcagccgt 
aagttataat 
tgattctaat 
attgtcagca 
agatatccca 
agtgcttaaa 
aagctatact 
tgtgttcttt 
tttctctaca 
aatacaatgt 
caggtattcc 
gttagattaa 
acattttaag 



tgattttctt 5820 
gaggaa gaaa 5880 
atgaaatcaa 5940 
tgttacatga 6000 
tttttcttaa 6060 
ttttatctaa 6120 
taaataattg 6180 
tttacaaata 6240 
ttttgtgaat 63 00 
gtggaaattg 63 60 
cagcctagga 6420 
tacagaatta 6480 
attaagtttt 6540 
actttttata 6600 
aaaataaatc 6660 
tttattcatc 6720 
gttgggattc 6780 
attttcatgg 6840 
aaccattgta 6900 
aaganaaaaa 6960 
6968 



<210> 47 

<211> 1033 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 22 19 14. 2. dec 
<220> 

<221> unsure 
<222> 1027 

<223> a, t, c, g, or other 



<400> 47 

cagcactcgg ttccgtgcaa ctttcaagtg agttgcgaac tccgccctgt aggccggtgc 60 
tggtggcccg gcgcgctgga accgcggcga cccgctccag cgcgggacca gcagcaaggg 120 
ccgagcgcca ggttctccgc ggcagaaagg gcgggtggga gctgtaactg ccccggccgc 180 
ggggcgcgcc cgctcccaag tcggcttcct ccccgccggg gccgctttgc ctcgggtctc 240 
cccattctcc aggtcccctg aactgcacag tcggaggccg tgggcggcgg gctctgcctc 3 00 
cgccgaggga cagccggatc gcccctctgc ttcccgcaac tgccctgatc accccccgtc 3 60 
ccagcccttg agtgaacgtc cttctgagcg gcttcctggg gtcctcccca cgtcccaaag 420 
gccggcaaga tggtgtcctg gatgatctgt cgcctggtgg tgctggtgtt tgggatgctg 480 
tgtccagctt atgcttccta taaggctgtg aagaccaaga acattcgtga atatgtgcgg 540 
tggatgatgt actggattgt ttttgcactc ttcatggcag cagagatcgt tacagacatt 600 
tttatctcct ggttcccttt ctactatgag atcaagatgg ccttcgtgct gtggctgctc 660 
tcaccctaca ccaagggcgc cagctggctt taccgcaagt ttgtccaccc gtccctgtcc 720 
cgccatgaga aggtacccca gggggaagcg ggaagggagg cttaggggtg ggtatgaggg 780 
aaagagccct ggacctggga tttaggatgc tggggacacc ggggttctat cccagtcctg 840 
ctgctattgg aggggcagga ccttgacagg gcccagccta cctcatcttc tgggataatc 900 
aggtgggggt acacatgaat gacctttgag gatggaaatt ggtttggggg tagtggtggc 96 0 
ggcagctgga gcatgggcca cttcatggag gtgtgagcgg gtggcagccc tgctcttggg 1020 
tctgtgnttg tgg 1033 

<210> 48 

<211> 1733 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 347748. 2. dec 

<400> 48 

caaccgtgag gtgttgggtt tgggggacgc tggcagctgg gttctcccgg ttcccttggg 60 
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caggtgcagg 
gcggggaggg 
agcccgcctc 
cctgggcttc 
ccatggtggg 
tgctgctggt 
tcctgcttca 
gcgggcggct 
agatcgacca 
gcctcgggaa 
tggcagacat 
aagaagacca 
atgaaagttt 
ttaaaaagtt 
caaatgatgg 
tagctgagaa 
aagaacaaat 
caaaagttga 
aaagtcatca 
cagattcaca 
ctcttcagcg 
tactaaagca 
tctttgatga 
atggggatga 
atgaggaaga 
ttcaaatgga 
aggaatgctt 
gacttttgtt 



gtcgggttca 
acctgcggct 
ctccctcggc 
tgcctgcagg 
tttcggggcc 
ggtgatcgtc 
ggaggaggtg 
ggaaaagcgc 
gaaggaggcc 
gaga tgc gag 
acatcattta 
gcttcaggac 
tcagtgtgga 
agcagaccag 
aaaggaattg 
tgttgcagat 
caaaagaggt 
tgatcttccc 
agcaatctcc 
cataaaccac 
tttaattcca 
ggctaccaag 
aaatgaatcc 
tggtaacgta 
agatggtgat 
tcctgcagac 
cagaaaacct 
gtaaagacga 



aagcctccgg 
tgcggccccg 
cggccctggg 
tgtctgcggc 
aaccggcggg 
gtcctcgcct 
gccgagctgc 
aattcggacc 
gactacggcc 
gatgacaagg 
aaggagcaac 
tataggaaga 
cagcagatga 
tttttagagg 
gatataaaca 
aagaatgaag 
ggtgatgcag 
cctgctttaa 
catcttccaa 
aatggaaacc 
ggctcaaact 
gacagagtca 
cctgttgatc 
ggtgagtatg 
ggtggagagg 
tatggaaagc 
aaagtgctgt 
attgtatcag 



aacgcgtttt 
cccccttctc 
gccgtgtccg 
gaggccccta 
ctggccgcct 
tcaactactg 
agggccaggt 
tcttgctgtt 
gcctcagcag 
ttaaactaca 
ttgctgagct 
acaatactta 
aggaattgag 
aacaaaagca 
atcaagtagt 
aaccctcaag 
ggatgcctgg 
ggaagcctcc 
ctggacaacc 
ccggtacttc 
tggacagtga 
gtgatttcca 
cgcagcatgg 
aggcagacaa 
aagacgtcca 
aacatttcaa 
aaaatgaaat 
ttgtaaagat 



ggcctgattt 
cggctcgcag 
ccgggcaact 
gggtacagcc 
gccctctctc 
gagcatctcc 
ccagcgcacc 
ggtggacacg 
ccggctgcag 
gaacaacata 
tcgtcaggaa 
cc ttgtgaag 
agcacagcat 
agagacccaa 
acctaaaaat 
caatcatatt 
aatagaagag 
tatttcagtt 
tctctcccca 
aaaacagaat 
acccagaatt 
taaattgaag 
ctctaaactg 
gcaggctgag 
agatgatgaa 
tgatgtcctt 
cattctactt 
acattgagat 



gaggaggggg 120 
ccgaccggta 180 
ccagccgagg 240 
cgatttggcc 3 00 
gtgctggtgg 3 60 
tcccgccacg 420 
gaagtggccc 480 
cacaagaaac 540 
gccagagagg 600 
tcgtatcaga 660 
tttcttcgac 720 
aggttagaat 780 
gaagaaaata 840 
aagattcaat 9 00 
attccaaaag 9 60 
ccacatggga 1020 
aatgacctag 1080 
tctcaacatg 1140 
aatatgcctc 1200 
ccttccagtc 1260 
caaacagata 1320 
caaagccgat 1380 
gcggattata 1440 
ctggcttaca 1500 
gaacgagagc 1560 
taagtcctaa 1620 
tgtcctttct 1680 
aga 1733 



<210> 49 
<211> 574 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 401482. 2. oct 



<400> 49 

tctgctataa 

catgtacttg 

cgggcggcgc 

tfccgcgggaa 

cccagctgcg 

tcgttcgcaa 

ctcaggaaat 

agagccatgc 

ggaaggagcg 

ataaagtgcg 



atatttttgt 
ccgtatcatt 
ggcggcgtcg 
gaagaaggag 
cgtcgccaaa 
agtctatcgc 
tctacaaggg 
gccgccggtc 
gctgtatcct 
agactgactg 



acacatttgt 
aggtaagttt 
gcggcttgtg 
gage tgc tga 
gtgacaggcg 
ccgagtcctc 
caagaaatac 
accaagcacg 
ctgcgcaagt 
gcaaaaaaaa 



tgaagacata 
atatttgaat 
cagcaatggc 
aacagctgga 
gtgcggcctc 
actgttatta 
aagcccctag 
aggagaagct 
atgcagtcaa 
aaaa 



tattttcatt 
ctacgctctt 
caagatcaag 
cgacctgaag 
caagctctct 
accagactca 
acctgcgacc 
gaagaccaag 
ggectgagat 



tttttctggg 60 
tccctcggag 120 
gctcgagatc 180 
gtggagctgt 240 
aagatccgag 3 00 
aaaagaaaac 3 60 
caagaagact 420 
aagcagcagc 480 
gacaacgaca 540 
574 



<210> 50 

<211> 444 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 274551.1. oct 



<400> 50 

gtcacatcct tggcctgaaa ttctttgctg 
accactttac aaagttttcc ttgtctatca 
ttcttaaggt catcaacctt aattgggttg 
agcttgacat acataggctc ttcacatttg 
ttgtctaagg ctttggcagc tttgcaggtt 
gtcttcagtc tcttgtagag cagtattaat 
ttggcatggt ggtgggtaca ggtgaaggtg 



tagcccttca ctgtgacaca actgcaacca 60 
gttttacaaa ggcttageca tgtctctggt 120 
atttggtgtt eggcacaaag ggccacccct 180 
gatgeaagea cgcaaagatg ggcttggccc 240 
ccacctgcta ggccatcatg getgagggeg 3 00 
atccattaca cctccagcag cgtgtcttcc 3 60 
aatcttgagc cactcaactt ctgetctgag 420 
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catgttgcca tgcagagaaa gaga 444 

<210> 51 
<211> 852 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 411408 . 20 . dec 
<400> 51 

ggaaggcaat tttataactt tatttgatct gacgatcagc gattagttct catccacatt 60 
gactgtctgt agatttttga aagtggtaac aggtacatag gtaaccaaag tatatagctt 120 
atttggtgaa tcctcattac gttttctgga cagccgcaca cggattcggt atggcacatt 180 
ccttattcct ttggcccaga cagctttgtt gagcctggtg tcaatgcgca catctggagt 240 
tcccatctcc ttcatggcaa atttccgaat ctctttgagt gcccgaggtg cacgcttctt 3 00 
gaagcccact ccatgggatg cgcttgtgaa tgttgaatgg tgtattctcg ggttaaccac 3 60 
ttcgttgcat ggccagaacg gccctttttc ttctcgccac ccttctttgc gggagccatt 420 
ctgcagcgtc caagtgggta acccgagaat acaccatcaa cattcacaag cgcatccatg 480 
gagtgggctt caagaagcgt gcacctcggg cactcaaaga gattcggaaa tttgccatga 540 
aggagatggg aactccagat gtgcgcattg acaccaggct caacaaagct gtctgggcca 600 
aaggaataag gaatgtgcca taccgaatcc gtgtgcggct gtccagaaaa cgtaatgagg 660 
atgaagattc accaaataag ctatatactt tggttaccta tgtacctgtt accactttca 720 
aaagtaagtt ctccatccca taaagccatt taaattcatt agaaaaaatg tcctaacctc 780 
ttaaaatgtg aatccatctg ttaagctagg ggtgacacac gtcattgtac cctttttaaa 840 
ttgttggtgt gg 852 

<210> 52 
<211> 779 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 03 5 97 3.1. dec 
<400> 52 

tatagggtaa acatccaatt aaaaggcaga tattggctat ccttggggag gccagcaaat 60 
ttccggataa gggaagctgg tgggcaggaa aaaggcatgc aatgggcctc atgagccaga 120 
agaatagctg ctgactgatg cctgggggac caagcccata attgtggagc tggagccgca 180 
ctgtgtccct tgcttcagtc tgatctttgg ggagcctgca cagttctgac ctcagttaag 240 
agctaaggga agagatttgg gaatgtttgg ccctgtgtaa cagcaaaagg attcattggt 3 00 
gagggtgtct ggcaggtatc cgtaaagata ataagatgaa gggaacatcg ccgtttggaa 3 60 
agtgtcgtga tatgatacac aagttgtgct gcctctgtgg ctctaaggca taccaccttc 420 
agaagtcaac ctgtggcaaa tgtggctccc ctgccaagcg caagagaaag tgtaactgga 480 
ctgccacggc taaaagaaaa taccacggcg actggttgaa tgaagcacct aaacattgta 540 
tactgcagat ttaggcatgg attctttgca ggaacaacac ctacacccaa gagggcagca 600 
gttgtgtcat ccagttcatc ttaagaattt caatgattag tcacacaata aatattccgg 660 
tttttaaaaa tgtatatatt ttaaacatat atatgtttat atgtatatgt tatatctgta 720 
ttacatatat gtgaaaagag gcagagattg tcagattgga ttaaaaagct gtctgtaag 77 9 

<210> 53 

<211> 1229 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No : 45653 6.1. dec 

<400> 53 

gttctagatc gcgagcgccg ctttttttca ttggccaggg gtcccaggcc gtccaggtct 60 
tggggcgccg cggcggaaat cgcgcggatg ccagaacgcg ctctcagctt tcgggtcctg 120 
gcggctgcgg ctgccgccat catggtgcgg aagcttaagt tccacgaggc agaagctgct 180 
gaagcaggtg gacttcctga actgggaggt caccgaccac aaacctgcac gagctgcgcg 240 
tgctgcgggc gttaccggct gcagcggcgg gaggactaca cgcgctacaa ccagctgagc 3 00 
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cgtgccgtgc 
cgtgcgcgct 
cggttcgctg 
caccgtgctc 
gcaaggggca 
cgcagcatgg 
gagtacaatg 
tggctgtctt 
ctcctcaagg 
atgcatagag 
tactgattat 
gggtattgtt 
ccacgccctg 
ttcttgctcc 
ttataaactg 
ccccaactgc 



gtgagctggc 
tcggccgcgc 
gagctctgcg 
ctcaagctgc 
cgtacgcgtg 
aggactttgt 
aggagcgcga 
ttacagatgg 
gtatcagacg 
ccacgggagg 
taattgactg 
ggcttcttgt 
tttcccttta 
tggccgtggt 
agcgagacta 
tttgtaagtg 



gcggcgcctg 
tgctggacaa 
acttcgtcac 
gcatggcgca 
gggccctgac 
cacttgggtg 
tgacttcgat 
gaaaactgag 
gtcgtaggtt 
tgcgtccttg 
tgccttggga 
gttggaaact 
tcgtcttgtg 
taagctgtgt 
gtgttcagga 
caacttttt 



cgcgacctgc 
gctgtatgct 
ggcctcgtcc 
gcaccttcag 
gtggttaccg 
gactcgtcca 
ctggaagcct 
gcctgatgct 
cttaagaatt 
ttttccagga 
aacagtggga 
ttgtaatgta 
gt tggac tgg 
acagatgatg 
tctcctccct 



ccgaacgcga 
ctcggcttgg 
ttctgccgcc 
gctgccgtgg 
accccgcctt 
agatcaagcg 
agcggatctc 
ggagattcta 
tgattcatca 
aatgttctta 
agtaacttgg 
aaaggaaaaa 
ttcaattcgt 
gagagtttgg 
tgtttaaatg 



ccagttcccg 
tgcccacgcg 
gccgcctccc 
cctttgtgga 
ccttgtcacg 
gcacgtgcta 
ccactttgca 
tgagggtgct 
gtggcaggcc 
gaact tggac 
tgcagcactg 
c tggaaat cc 
ttaactcgaa 
cctcaagttt 
tcaataaatg 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1229 



<210> 54 

<211> 1342 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 387807. 4. oct 



<220> 

<221> unsure 
<222> 1195 
<223> a, t, c, 



g, or other 



<400> 54 

atataagagt 

tgactctaaa 

cctctggttt 

gaagactgag 

caatccttct 

aaccagatgg 

ctgatagttc 

gaaaccgccc 

ctgacacagc 

gcttttgggg 

ctgttgggta 

gattagggat 

tttcacgtct 

attgataatt 

cacccaggat 

ggacaatgag 

tggggagata 

gataggcaga 

acactgtggg 

atccacaggg 

ggggagaggt 

catgcccctc 

tacctctccc 



ttgagaggta 
ccacatggca 
tggtccctgg 
tttcaaagtt 
cagtcactaa 
tgtataagag 
tcatgagtat 
tggtgttcca 
cacttctggt 
acagggggcc 
ctggtgaagg 
tgattagcat 
gtatgataga 
gtcgatttta 
atggtctttg 
aacagccact 
agaaccgtga 
aaagaaaatg 
ggacagaaca 
agggccctac 
gcagtgaccc 
tgccagctga 
ccactcctcc 



ctggtgcact 
tacagttcct 
cagctgctga 
ccagtattgc 
atgagtgtga 
tattttgtca 
tactgctctt 
gcagcagcga 
ggttgccact 
caaggagc ag 
attctggatt 
atgatttagg 
ggtggtattg 
tgttaatccc 
ctgtttgatt 
tcccatcagg 
gagaagttgg 
ttgccagttg 
tctgcatgtg 
atcatctgct 
tgcaggccag 
gacagcccac 
tc 



tcttcacact 
gcctacttaa 
ttcttggcaa 
ctttttcttt 
caacaccttg 
aaactataaa 
ctttcccaca 
agaggaacac 
catctgaggc 
tagacggtaa 
tgaatcctgc 
gcaagttgct 
tttgacttgt 
tagtacatgg 
ttcctcatcc 
agtcactgca 
cacaaaggag 
atggggaaga 
cactctcatc 
ac cc tgaagg 
ccctccaacc 
acacacccca 



aacagacgtg 
tgtttacttt 
aacctcagag 
tttttttcta 
tacggttgtt 
ggaggatgtg 
gttaaaagaa 
gaaagcaaat 
tgtgagtctt 
tcgttaagat 
ctctccgtct 
tgagctcttt 
atttgtgaag 
cctgctgtca 
ccagtctcaa 
aggcccccag 
ttatgggaca 
aaggaagtca 
tcttgtagtc 
atctggaggt 
tcctcccaca 
gccttaatga 



tgaggatgta 
tctacctctg 
cttggagtca 
gccatgatat 
ggtggcatta 
gctgtagggg 
tattggcaga 
ggcagtagcc 
gcctggacag 
tgtggatgga 
gctaagaatt 
gggcctctgt 
tttcaatgag 
acacccagga 
ggggaagcca 

ggtgggatgg 

aagggtccaa 
gagggctcag 
agcaacaggt 
aagangctct 
gtggacaccg 
ttgttctctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1342 



<210> 55 

<211> 859 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 406790. 3, dec 

<400> 55 

ccacaggggg gcggggaagg aagatggcgg cgcccagcgt cccgtgagga gagaggacac 60 
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agggatcccg 
cctcacgcct 
tacaggactc 
catttcagca 
accagcaagt 
caggtcaagc 
tgtccagaaa 
actcaataga 
aagtgtatga 
tgtttttctg 
ttatttacca 
tcttcaattt 
aatagtattg 
ttgaatgttt 



gggagcggcc 
cctgacaggt 
ccctcacaag 
ctttagcttc 
aagacccaag 
caaagcagaa 
gccaaatgaa 
tgtagggaaa 
tttgccagga 
atatacttac 
cctgaaaaga 
aactgggatt 
cattgtaagt 
atcctgctt 



ccagactcgt 
tgcgtaggag 
ttcttacatc 
ctcaaacgca 
ccagaaccag 
atatatgaga 
aaggaattga 
gagacaaaag 
attttggctc 
ttatttgaaa 
ataatttgga 
tgaaattaaa 
attgtattaa 



aaattatggc 
gctctgtctg 
ttctcaggaa 
tgtatgtcac 
tagcatctcc 
tgagacctct 
tagaactaga 
aggaaaagcg 
gactatccaa 
ctctatcttt 
actgcaggtc 
ttttctgaat 
aaatagtcac 



cgcatctccg 
gtatcttgaa 
tgtcaataag 
acagctgaac 
tttccttgaa 
ctcaccgccc 
gccagactca 
gtggaaagag 
aatcaaactc 
agtgaaacaa 
ctgtcttagt 
tctatttaca 
cctttccttt 



cacactctct 120 
agaagaacta 180 
cagtggatta 240 
agaagccaca 3 00 
aaaacatctt 3 60 
agcctatctt 420 
gtaattgaag 480 
atgaagctgc 540 
acaggtactt 600 
aaagctaatt 660 
atttttcctc 720 
cagaattgta 780 
gcaattaaag 84 0 
859 



<210> 56 
<211> 279 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID'No; 



41242 0.63 .dec 



<400> 56 

aggtcctatg 

aggacagcag 

cacccatcgt 

cagcagaggc 

acgtccatca 



cacaacagcc 
aagctggaag 
tcttatctcg 
caggcctcaa 
gcaccccttc 



agagttatct ctgtgtatgt caccttgctc tccacaaaga 60 
aaagtcctct gggcacctgg gctctggcgt tgctctcttc 120 
cactcagggt agaaaagttt ccagtacaga aattcaccca 18 0 
ttagaggcct acacaaatgg ggtagacgct ctcacagtcc 24 0 
catgttcttt aaggataag 279 



<210> 57 
<211> 1038 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 196623. 3. dec 



<400> 57 

cccccgcagc 

gcggcgttcg 

gaatagaaca 

actcttgaaa 

ggccactgaa 

caaatgggac 

tgtggatttg 

aacagacagg 

aaagatcatg 

aattatgcgt 

aaatggtgac 

tggagtagag 

tatagatttt 

caccctcctt 

ttttggaaag 

ccctgatagc 

cccaaatgcc 

cgctgttaat 



cctagagccg 
tgtccgagtt 
gcaatgagag 
aaggatccag 
ggaccttgta 
gcatggaatg 
gtgtccagtt 
aaatcaactg 
ttcaaccggc 
gcacttaaag 
tattacagta 
gagaaagcta 
cctaagcctc 
gggctattcg 
aagttaacag 
acttttcaga 
ttgagaattt 
gctgaaga 



cccaagggat 
ctctgcaggt 
ccagtcagaa 
gaaacgaagt 
acatgcccaa 
cccttggcag 
tgagtccttc 
ggtttgaaac 
ccaaaaagaa 
ctgccagcaa 
gtgggaatga 
aaaataatgc 
tgattgcagt 
atgccgtgta 
cgggagaggc 
aagaagtctg 
caaaagaggt 



ggcgatggcg 
cactagtttc 
ggactttgaa 
gaagctaaaa 
accaggtgta 
cctgcccaag 
attggaatcc 
tctggtggtg 
aaatgccata 
ggatgactca 
tctgactaac 
cgttttactg 
ggtcaatggt 
tgcatctgac 
atgtgctcaa 
gaccaggctg 
aatcaggaaa 



tacttggctt 
ccggtagttc 
aattcaatga 
ctctacgcgc 
tttgacttga 
gaagctgcca 
tctagtcagg 
acctccgaag 
aacactgaga 
atcatcactg 
ttcactgata 
agggaatttg 
ccagctgtgg 
agggcaacag 
ggacttgtta 
aaggcatttg 
agagagagag 



ggagactggc 60 
agctgcacat 120 
atcaagtgaa 180 
tatataagca 240 
t c aac aaggc 3 00 
ggcagaacta 3 60 
tggagcctgg 420 
atggcatcac 480 
tgtatcatga 540 
ttttaacagg 600 
ttccccctgg 660 
tgggctgttt 720 
gcatctccgt 780 
agatgcttat 840 
ctgaagtttt 900 
caaagcttcc 960 
aaaaactaca 1020 
1038 



<210> 58 
<211> 457 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 
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<223> Incyte ID No : 427 9 16. 8. dec 
<220> 

<221> unsure 
<222> 398 

<223> a, t, c, g, or other 



<400> 58 

gacagcccca 

actgccagtc 

gcccagaggt 

cgcggacaca 

acgggggac t 

acgggaagga 

agcagagcag 

caccgggtct 



gcgaggccat 
ccctgacacg 
gtctgtagtg 
ctcgcaggtc 
ggaggaggag 
agtgggcagc 
tgcccggatc 
acagcagctg 



ttccagcaca 
ctgtgcccca 
agcttctgcc 
gctgtggccc 
ccagagctca 
atcatcggga 
accatctccg 
tcttccatgc 



tagaagagag 
cccactgcag 
ctagtgactt 
cagcctcgcc 
gcatcaccct 
agaagggcga 
agggctcntg 
agtctcc 



attggaaacc 
cccagtgctg 
ttgagccggc 
tgacagaatg 
cacgctgcgg 
gactgtaaag 
cctgaacgca 



aacgtgcaga 
aatgaaccct 
caggttgcag 
agcggctcgg 
atgctgatgc 
cgaatccggg 
tcaccaccat 



60 

120 

180 

240 

300 

360 

420 

457 



<210> 59 

<211> 7680 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 264633. 8. dec 

<220> 

<221> unsure 
<222> 6249 

<223> a, t, c, g, or other 



<400> 59 

ggaattgcag 

ctgggcataa 

gcgggttgca 

ctgtggacat 

gcagcagcat 

gatggaattt 

gatgaagcag 

tgaaccgcga 

gggtcatgca 

aagatcttac 

ttgagtccca 

gctctccaga 

aatcttccat 

aaccagaaaa 

agcagggcct 

ttccagctaa 

acaacgttgg 

tttctgcaga 

gccagtatca 

gcctcgtagc 

aggcacccac 

cccagtggga 

ggaaagccaa 

ccaaggaggc 

gtgaagaagc 

ggcggagggc 

agagtactcc 

ggaggaagaa 

ttttggtctt 

gtgaggagat 

cacgctacaa 

atgtaaatgg 

aagctgagga 

acacaatcac 

cgctcaagag 



gtgtgagcca 
aattctggac 
gacaccagca 
ctcgggccgc 
ccatgaccgc 
agcatcaagc 
gcaccagaaa 
gtgttctgtg 
gaagtttaac 
ttcccgggtg 
ggaaatgaaa 
aattaagctg 
ttgtggtgac 
caaggcgaga 
tgtcgaagca 
gaaagagtct 
tgatttggtg 
tccactcctt 
cgtacagttc 
ttttgaagga 
gaaagctgag 
aatgggcatt 
gttcaccttt 
tggcattgct 
tgctgaaaac 
caaactgtgt 
tcaaaagacg 
gaccacagtc 
ctgtcaaaaa 
tgaagagctg 
caccaagttt 
gaaaaaaaga 
cacacccagg 
tgacaaaacg 
ccaggcagca 



ctgcacccgg 
acttgtggtg 
gccccctatt 
atagccctgt 
aacccatcag 
agagtcccct 
tccttcggca 
ttcctcagca 
ggccacgacg 
tttaatggag 
gggattggga 
aaaat caeca 
agtgctgctg 
aggaacagga 
gctcttgtgt 
tgtccaaaca 
tggtccaaag 
cacagctata 
tttggtgacg 
gaaggacagt 
aaaat taagc 
gttcaagcag 
ctctatgtgg 
gcagagtctt 
cccaagtctg 
agetctgeag 
gcagaggctg 
tccatgccac 
cacagggatg 
ctcaggtcac 
gccctggtgg 
aaccacacaa 
aaaagactca 
gecagaacaa 
acgaaaaatc 



cccttgcctc 
gcacgagctg 
gctgcacgtc 
tgegegggea 
agtgttctaa 
ttctgttcag 
gtgecaaegg 
aagcccagct 
ccctgccctt 
aacccggcgc 
caccccctaa 
aaacatacat 
atgtgtctca 
agaggagcat 
ctaagatctc 
ctggaagaga 
tgtcgggtta 
ccaaacttaa 
ccccagaaag 
ttgaaaaatt 
tattgaaacc 
aagaagctgc 
gggaccagct 
tgggagaaat 
tgagagaaga 
agaccctgga 
accccagaag 
gaagcaggaa 
aggtggtagc 
agtggagtct 
cccctgtcca 
agaggataca 
ggacggacaa 
gctcttacaa 
tgtctgatgc 



atgtttgaat 
tetgetttte 
aggtgtcctc 
tcctgtgcgt 
gaaeggaage 
agtgttgtaa 
gaagactccg 
ctccagtagc 
tattccagcc 
acacgatgcc 
cactacccct 
gaatgggaag 
gtcagaagaa 
aaaatatgac 
aagtccttca 
caaagaccac 
cccttggtgg 
aggtcagaaa 
agcttggata 
atgccaggaa 
aatttcaggg 
aagcatgtca 
tcatctcaac 
ggcagaatcc 
gtgcattccc 
gagtcacccc 
aggagtaggg 
gggagatgea 
tgagcaccca 
gctgagtgag 
ggctgaagaa 
ggaccctaca 
gcacagtctt 
ggccatggag 
atgtaaacca 



gataatttac 
tgtggagact 
cacctcagta 
tgcaggagat 
atctgggctg 
agtgcataaa 
agetgegagg 
ctgeaggagg 
gacaagctga 
aaactgcgtt 
atcaaaaatg 
cctctctttg 
aatggacaaa 
tecttgetgg 
gataaaaaga 
ctgttgaaat 
ecttgeatgg 
aagagtgcac 
tttgagaaga 
agtgccaagc 
aaattgaggg 
gtggaggagc 
cctcaagtag 
tcaggagtca 
atgaagagaa 
gacataggga 
tctcctcctg 
gcatcccagt 
gatgettcag 
aagcagagag 
gactctggta 
gaagatgctg 
eggaagagag 
gcagcctcct 
ctgaagaagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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gaaatcgggc ttccacggca gcatcttcag 
ctgcatcctt aactgagaat gaggtctcgg 
catacgaaag tgcagacgaa acacaaactg 
gaggagtgac tgccaaaaag gagtatgtgt 
tgctctgtga aggaccctgc tgcggagttt 
gccagaaggg aggttcacct gcagcgagtg 
taaagagagc aagacagatg ttaagcgctg 
tgaggcttgt gtgaaaaaat accctctgac 
cctccacagc tgtgtgagct gccatgcttc 
taaaatgatg cggtgtgtcc gctgccccgt 
agcaggatgc tcagtgatcg cctccaacag 
gaaggggaag cgacaccacg cccacgtcaa 
ggggagcctt ctgtgctgtg agtcctgccc 
cgagatgcct gacggcagct ggttctgcaa 
ccaggatatc atttgggtga aacttgggaa 
tcccaaaaat gttcccccaa atattcagaa 
gtttttcttt gggtctaaag attattactg 
ggsigggggac cggggcagcc gctaccaggg 
cgcactgcaa gaagctgaag ctcgttttcg 
aacacaggag agcgagcgca agcccccacc 
cgggaaagtc cagatctaca cagcggatat 
cacagatgag aatccttgtg gctttgattc 
gtgccacccg caggtgtgtc ccgcgggcga 
ccagtaccca gagaccaaga tcatcaagac 
gagggacatc agaaagggag aatttgttaa 
ggagtgcatg gcgagaatca agcacgcaca 
cactatagac aaggaccgta taatagacgc 
gaatcacagc tgccagccca actgtgagac 
tgtgggcctg tttgccgtct gtgacattcc 
cctcgattgt ctgggcaatg aaaaaacggt 
attcctcggg gatagaccaa agacctcgac 
gaccaagaag aaaacgaggc ggcgcagagc 
cgagtgcttc cgctgcggtg atggcgggca 
caaggcctac cacctgtcct gcctgggcct 
tccttggcat cattgtgacg tgtgtggcaa 
caattcgttc tgtaaggagc accaggacgg 
gtcctactgc tgtgagcatg acttaggggc 
gcccccccca gagccaggga agccgaaggg 
agtcacagag ggcaaatagc gccaggcggc 
cggccggccc tgcctgcggg agagggcgag 
ccgccaggac acagacgtac aggcctcctc 
tcctcagcag cgtccgctgc gtctgcactg 
ggacaaacag cctcactcct cagcgttacc 
ttccctccca ctctattttt ttaggttaaa 
ctctgaaatg tgtagcgtag gcttttccca 
cccctttctg gctctcagcg ccgtcgccac 
gactgaccct ggattcctcg aaactgccat 
ctgtatcatt tttttgtact aatgtgaatt 
gtgagaagct ggccctgcag gtggtggcag 
tttttctcct catggatttg aatgaaatgc 
tacgcggagc actttcgagg cctggccggg 
ctgaactcgc taggttctta tcaacatttg 
gcttttcttt accagtttct gatttttatt 
aaatcaccac cgactgaagt gtgtgtttac 
agtgactttg cagggcgtga gaccgcagtc 
gagccccgta ggcgctgcag gccccgggga 
gtctagctct cctgccaccc agagtggctt 
aagatgcagt ctctgctgat cacatgtgcc 
tcctgggggt ggaggcctgc ctggcaggtg 
ggtctaggcc aggtatgaca cccactctcc 
ccagttctac agagtgagac ctatctatct 
tttttttgag ggatccttga ccctgggaag 
tgtgcctttg cccacgtgtc ctgaggggct 
agcagcaggt gcttttttat ggccttttct 
tgcagagaag tttacagaac tccccttgaa 
tgtggcatct tttatgcctt ggtaaaaact 
ccgtggtttt gtgacctgta agcgtggggt 



ctcttgggtt tagcaaaagt tcatctcctt 2160 
acagcccggg agacgagccc tcggagtccc 2220 
aagtatctgt ctcatccaaa aagtctgagc 2280 
gccagctgtg tgagaagccg ggcagcctcc 2340 
ccacctcgcc tgccttgggc tttcccggag 2400 
tgcctcaggg attcactcat gtttcgtgtg 2460 
tgtggtaact cagtgtggaa aattttacca 252 0 
tgtatttgag agccgaggtt tccgctgccc 2580 
caacccttca aacccaaggc cgtcaaaagg 2 64 0 
tgcctatcac agcggggatg cttgtctggc 270 0 
catcatctgc actgcccact tcactgctcg 27 60 
cgtgagctgg tgcttcgtgt gctccaaagg 2820 
agcggccttc caccctgact gcctgaacat 2880 
tgactgcagg gctgggaaga agctgcactt 2940 
ctacagatgg tggccggcag aagtttgcca 3 000 
aatgaagcac gagattggag aattccctgt 3 060 
gacgcatcag gcgcgagtgt tcccgtacat 3120 
ggtcagaggg atcggaagag tcttcaaaaa 3180 
tgaaattaag cttcagaggg aagcccgaga 3240 
atacaagcac atcaaggtga ataagcctta 33 00 
ttcagaaatc cctaagtgca actgcaagcc 33 60 
ggagtgtctg aacaggatgc tgatgtttga 3420 
gttctgccag aaccagtgct tcaccaagcg 3480 
agatggcaaa gggtggggcc tggtcgccaa 3540 
cgagtacgtt ggggagctga tcgacgagga 3 600 
cgagaacgac atcacccact tctacatgct 3 660 
tggccccaaa ggaaactact ctcgatttat 3720 
cctcaagtgg acagtgaatg gggacactcg 3780 
tgcagggacg gagctgactt ttaactacaa 3 840 
ctgccggtgt ggagcctcca attgcagtgg 3 900 
gaccctttca tcagaggaaa agggcaaaaa 3 960 
aaaaggggaa gggaagaggc agtcagagga 4020 
gctggtgctg tgtgaccgca agttctgcac 4080. 
tggcaagcgg cccttcggga agtgggaatg 4140 
accttcgact tcattttgcc acctctgccc 4200 
gacagccttc agctgcaccc cggacgggcg 4260 
ggcatcggtc agaagcacca agactgagaa 4320 
gaagaggcgg cgacggaggg gctggcggag 4380 
cgcttggccg gatccagggg cggtgcaggg 444 0>' 
catgaactgg cccggaggac ccagctcgag 4500 
gggagggagc gcctccccac c act gage ca 4560 
atgacegtet gagcccagct cagcgttcct 4620- 
gccacacttg aatttctccg aatgtcaagg 4680 
gttaattggc atatggaatg ttttaatctc 4740 
agggtegcta gaaactcgtc ttcgcgttgc 4800 
tegggagagg ctgggtgagg cccgtgtgag 4860 
tgtgatcatt actctgctct ttggaaatgg 4920 
gttcctcaga aacgettett ttccatccta 4980 
caatggtgtt gtaagatttc ctcccgtagt 5040 
caataacacg tccactttca acgtgtagtt 5100 
ttgggcctac ttctcacctg ggectatett 5160 
ggggataact ttgtatattt ttttcatttg 5220 
ctcaatatat ttttgctaaa cctatttcac 5280 
tgatgeggee ctgagctcca tggegaaagg 5340 
tgettagage acaggaagtg acaacttagg 5400 
ccccagcacg tgggtctaaa gagagaegga 5460 
ccatctcagc atctgtgggt ctggtgatgg 552 0 
ctctgccagg gcacctactg agaggtgegg 5580 
tgcgtgcctc gtacgtgtgt tatgggcact 5640 
tgtgagattt cactttagtt tttaaaaggt 57 00 
gagtactaca tatgttttaa gacttggttc 5760 
tctggagcac cctgagaagg gggcaccatg 5820 
gcttgtctgg gagggaggga gagaacattc 5880 
taaaataacc taagggggac acatccatct 5940 
aactgetget gaggctcctg ttaaattttc 6000 
gcagtgtctt tggacctgag agtggctact 6060 
tcaggggtgt gtggccctgc agggtcccac 612 0 
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gcctccctga gcactgactg gaagtttcac tggctggtgg ctgtcccttc tcccatcagg 6180 
gtccccagca aagttaacta cacagaggac ccaggggaaa cgagctgtgt agccactgac 6240 
ttgctcgcnc ggccgtggcc tctgaggggc actcgccggt taagacaggg tgggagtagt 63 00 
gctttccagt tcagactcta acttctccca aagtgtccta agaaaatact ggatcggctc 63 60 
atagatttat gctccttatg atgccctaac ttggaaggtt gttctaggga caggccgggc 6420 
agtgtcccca cacacacctt agagtcgaag gccccagggc cccgctgtca cttgcccaaa 6480 
agatcccttc cggcaggtaa gggactacca atgcttacgt caaaacagca gaatcggctt 6540 
tgcagtgcac tttggggagc agatattaac ttatttttgt gttggacagt agtgaaatct 6600 
tgtgattttt aatcgctttg ataatacttc caaattttat gatttttctg aaggaaataa 6660 
tgcaaacatt ttaaatatgt ttctccccct ttccaaaaac tgttaaacta atgagcaagt 672 0 
aacactaact ttgaatgtct ctacaatacc cgttgataac tcagtggagc caggctttgg 6780 
ggtagcggcc ctgagcttgc agggtttctc gccactgggg ctgaccacgc ccccagctgt 6840 
gaccgtgggt gtggctggct ctcggccctg cccagctttg ttctgaggac gtggtgactt 6900 
cctgaacatc agcttcaatc ctccatcatt aatgtgaagc aaaacacaaa aaccgcccca 6960 
atccctcagg attccttggc atccgaaacc agcatctgca cctaaaccca tacccacccg 7020 
tgtgcgccca cagggggatg tgtccgaatg ggcagcttaa aatgtggtca cctgtggggg 7080 
aaactcttca ggcacctgaa gtgagaaccc agctgtccgt cctcaggccg gcctttcttc 7140 
cggcgacacc cgtccatggc tggctgggtc cccttcgcag tgtttgtctg tcttgacatc 7200 
taaaccccgg cgtgtgcagt gcccatcttc caggactacc ttattttcca gaattaaacc 7260 
tgttttataa ttcaagttaa tgcaaatgac tgtcagttgc caaatatctt gatcctatga 7320 
gtgtagttga tgactgtttg ttagtcagta gagtaaaatg ctgtgtccac ggggtgtcac 7380 
agcctcacca taccctgttg aggtgtgaaa tgccccgtca gaaattaaat acaaacttaa 7440 
atgtgcctat tggtgtctaa acttcataca atgtaaggtc agattccttt taggaatact 7500 
gggtgctgtc accaggtttg atagttagac ttaaaaactt gaaattcact ttttgggggg 7560 
agggatatac tgaaatagag agttgagact tgccagttgg gggaaaatag catttaaaat 7620 
ggaaagctgt gtttggaaaa ttgtgtatga gtatttttgt attaaaaaca ttttaaaggc 7680 

<210> 60 

<211> 1576 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 337822. 4. dec 
<400> 60 

gggagaacat tctaaacaac tgatggaggt tcttgtggcc aggcaagttt cagtagagct 60 
gaggtagatg gagttgcttg ggaataagga ccatagagac taaaagaaag gtctaaggag 120 
catattttgc agaggcccaa gtccataata aactggaata acccttcttt gccccaggtt 180 
atttccattt ccagatccgg cactctagat actcctttta ttttctgggc ctctaggatt 240 
tatctcctgc tttctttttc aatgctgtca cctattattt ctgactaagg gttttctctt 3 00 
tataaatcca ctctgagatg gaattctttt ttccctaaat cccttttgaa atatttctaa 3 60 
cagatgacac taaatatggg aaagcatatt gggaccaaaa atataccatt ctttgttctt 420 
ctttcttcct ttaaaaagtt agttgccatg tttaaaaaga aaagtggaca tgtgctcttc 480 
tcctttctgg tcctatccct tcctccccat catatcctac tgtaaactgg ctctggggca 540 
gataaaaatc aaggaaagca aggggaagtg aatttttgta gtattgggtg ggctttcaag 600 
tataagggcc ttatatatga aagttaagtt agggtctact atgtatatgt gttcagtttt 660 
taaaatttta gttaattttt ttaaaaattt aggtttatgt ctgggaaaga aataaagaag 720 
aagaagcatt tgtttgggtt gcgaattcgt gttcctcctg tgccaccaaa tgtggctttc 780 
aaagcagaga aagaacctga aggaacatct catgaattta aaattaaagg cagaaaggca 840 
tccaaaccta tatctgattc aagggaagta agcaatggca tagaaaaaaa aggaaagaaa 9 00 
aaatctgtag gtcgtccacc tggcccatat acaagaaaaa tgattcaaaa aactgctgag 960 
ccacttttgg ataaggaatc aatttcagag aatcctactt tggatttacc ttgttctata 1020 
gggagaactg agggaactgc acattcatcc aatacctcag atgtggattt cacgggtgct 1080 
tccagtgcaa aagaaactac ctcgtctagc atttccaggc attatggatt atctgactcc 1140 
agaaaaagaa cgcgtacagg aagatcttgg cctgctgcaa taccacattt gcggagaaga 1200 
agaggtcgtc ttccaagaag agcactccag actcagaact cagaaattgt aaaagatgat 1260 
gaaggcaaag aagattatca gtttgatgaa ctcaacacag agattctgaa taacttagca 1320 
gatcaggagt tacaactcaa tcatctaaag aactccatta ccagttattt tggtgctgca 1380 
ggtagaatag catgtggcga aaaataccga gttttggcac gtcgggtgac acttgatgga 1440 
aaggtgcagt atcttgtgga atgggaagga gcaactgcat cctgactgta ggactgaaca 1500 
ttatgttcac tgcactctga ttttctgtag gtacagttca aagccctaaa ggagtctggc 1560 
ttttactatc tttctt 1576 



<210> 61 
<211> 4744 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 902943.1. dec 

<220> 

<221> unsure 
<222> 49 

<223> a, t, c, g, or other 



<400> 61 

cttcaaagca 

aggccccaaa 

gaaagggccc 

tgactttgat 

caaagtggaa 

taatgcccca 

aatgcccaag 

gcccaaggct 

tattgaagga 

caaagccccc 

gggtgatgtg 

atcaagggcc 

cacctaaaga 

ggtccagatg 

gatgttgaat 

tttaagatgc 

aatctcacag 

gatctgaaag 

gtgggcgttc 

agcatgcctg 

cttgatgtct 

gagggaaag t 

atctccatgc 

gtgtctctgc 

aaagtggata 

cctaaaataa 

gacgtgaacc 

ccagatgtga 

gagatgaaca 

ccctaaaatg 

acctgctctt 

tggcccagat 

gcccaagatc 

tgttgatgtc 

gggcccagaa 

tcgttttaag 

tgtcaaaggt 

tgatgtcgag 

ttcatctccc 

aggagactta 

tattaaggca 

gaatctgaaa 

agggccagat 

gaacattgag 

attcaagatg 

aaacttgaaa 

ggatctgaaa 

tgtggatgtt 

cagcatgcct 

cattgacatt 

agatgcaaag 

gatctccatg 

tggatgtttc 

gcccaaaagt 



gagggccctg 
gtggacgttg 
aagttcaaga 
ttgcatctga 
ggtgacctca 
gatgtgggtg 
ttcagcatgc 
gaccttgatg 
ccagagggaa 
aagatctcca 
gatgtttccc 
caaaagtgga 
tgcccaagat 
tggacgtgaa 
gtcccgatgt 
cagagatgca 
gtccaaaaat 
gtcctgaagt 
aaggcccaga 
gcttcaaagg 
caggacccaa 
tgaaaggtcc 
ctgacattga 
caaaagtgga 
ttaatgcccc 
aaatgcccaa 
tgcccaaggc 
atat tgaagg 
ttaaaggctc 
aaaggagagg 
gacataaaag 
gccaaattaa 
tccatgccag 
tctgggccca 
gtggacgttt 
cttcccaaat 
aaaaagccag 
gtccaaggaa 
aaagtttcta 
gatattgcag 
ccagaagtca 
atgcccaaga 
gtagctgtgg 
ggccc agate 
cctgacatga 
ggccccaagg 
gggecagagg 
catggtccag 
ggcttcaaag 
tctggtccca 
ctgaagggee 
ctggaaatta 
ccttcctaaa 
ggacattgac 



aagtggatgt 
aaggccctga 
tgecagagat 
aaggtcccaa 
agggccccga 
ttcaaggccc 
ctggcttcaa 
tctcaggacc 
agttgaaagg 
tgectgatat 
ttgctaaagt 
cattgacgea 
aaaaatgc cc 
cctgcccaag 
gaatatcgaa 
ttttaagact 
aaaaggagat 
tgacatcagg 
ctggcacctg 
agagggecca 
ggtggacatt 
caaattcaaa 
tcttaacctg 
aggtgacatg 
agatgtggat 
gatcagcatg 
tgaccttgac 
tecagatgeg 
ctaaaatatc 
tggatgtttc 
geccaaagat 
aaggtccaaa 
aaattgactt 
agttggaagg 
ccggtcctaa 
ttaatttttc 
atattgacat 
aagtgaaagg 
tgcctgacgt 
gtcccaattt 
atettaatge 
tgaaaatgee 
atctaccaaa 
tcaatgtgga 
atatcaaagc 
tgaaaggtga 
ttgatatcaa 
actggcatct 
gagaaggc cc 
atgtagacgt 
ccaagttcaa 
ttgaacctga 
gtggaaggtg 
gcacctgatg 



gaacctgccc 
tgttaacatt 
gaatatcaaa 
ggtgaagggc 
agttgacatc 
agactggcac 
aggagagggt 
caaagtggac 
gectaagtte 
tgaccttaat 
ggaagtgacc 
cctgatgttg 
aagatcagca 
gctgacattg 
ggacctgaag 
ccaaagatat 
gtggatgtta 
ggtcccaaag 
aagatgecca 
gatgtggatg 
gatgttccag 
atgectgaga 
aaaggtccca 
aaagttcctg 
gttcaaggee 
cctggcttca 
gtctcaggac 
aaac tgaagg 
aatgectgat 
acttgeaaat 
agatgtagat 
actgaagatg 
gaatttgaaa 
tgacattaaa 
gcttaatatc 
gggctctaaa 
aacaggtcca 
atccaagttt 
ggagctaaat 
agaaggtgac 
acctgatgtg 
caaattcagt 
aggagacatc 
aggtceggag 
tcccaagatc 
tgtggatatt 
aggecctaaa 
gaagatgece 
tgaagtcgat 
tgatgttcca 
gatgectgag 
acttgaaggg 
acctcaaggg 
tctgatgtct 



aaggctgang 
gaaggaccag 
gcccccaaga 
gatgtggatg 
aaggggecca 
ctgaagatgc 
ccagatgtgg 
gttgatgtcc 
aagatgccag 
c tgaaaggac 
tcaagggccc 
atgttcatgg 
tgcctggctt 
atgtctcagg 
gaaagtggaa 
ccatgccaga 
caggccctaa 
tggacattga 
aagtgaaaat 
tgaacctgcc 
atgtgaatat 
tgaacatcaa 
aagtgaaggg 
acgtggatat 
cagactggca 
aaggagaagg 
ccaaggtgga 
gecctaaatt 
ttggacctca 
gtagaaggtg 
gctccagata 
cctgacatgc 
ggctcaaagc 
gctcccagtt 
gaaggcaagt 
gttcagacac 
aaagttgata 
aaaatgeett 
ttgaaaagtc 
tttaaaggee 
gatgttcatg 
gtgtctggct 
aacatagagg 
ggaggcttga 
tccatgcctg 
tctcttccca 
gtggacatca 
aaagtgaaaa 
gttaccctcc 
gaegtgaata 
atgaacatca 
acccaaaatg 
cccagaagtt 
cagtggccca 



ttgatgtctc 60 
agggaaag t t 120 
tctccatgcc 180 
tttctctgcc 240 
aagtggatat 3 00 
ccaaggtgaa 3 60 
atgtgaagct 420 
tgatgttaac 480 
agatgaatat 540 
ccaaagtgaa 600 
agaagttgac 660 
cccagactgg 720 
caaaggagaa 780 
accgaaagtg 840 
aagtccaaag 9 00 
tattgacctg 9 60 
ggtagaggga 1020 
tgtcceggat 1080 
gcccaaattc 1140 
caaggctgac 1200 
cgaaggccca 1260 
agcccccaag 13 20 
tgacatggat 13 80 
taaaggcccc 1440 
cctgaagatg 1500. 
tccagaagtg 1560. 
cgttgatgtt 1620 
caagatgeca 1680 
atcttaaagg 1740 
atttgaaagg 1800 
ttgacattca 1860 
atgtaaacat 1920 
1 1 aagggaga 1980 
tggatataaa 2040 
caaagaaatc 2100 
ctgaagtgga 2160 
ttaatgetec 2220 
tcctgagtat 2280 
ccaaagtcaa 2340 
ccaaagtgga 2400 
gtccagactg 2460 
taaaagcaga 2520 
geccaagtat 2580 
aaggtcccaa 2640 
acattgactt 2700 
aac t tgaagg 2760 
atgccccaga 2820 
tgcccaagtt 2880 
ctaaagctga 2940 
t tgaagg tec 3000 
aagcccccaa 3 060 
aagggtgatg 3120 
gacatcaagg 3180 
gac tggcacc 3240 
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taaagatgcc caagatgaaa atgcccaagt tcagcatgcc tggcttcaaa gcagagggcc 33 00 
ctgaagctgg aatgtgaacc tgcccaaggc tgacgttgat gtctcagggc ccaaagtgga 33 60 
tattgatgcc ccagatgtgg gtgttcaagg cccagactgg cacctgaaga tgcccaaggt 3420 
gaaaatgccc aagttcagca tgcctggctt caaaggagag ggcccagatg gggatgtgaa 3480 
gctgcccaag gctgacattg atgtctcagg acccaaagtg gacattgaag gccctgatgt 3 540 
taacattgaa ggaccagagg gaaagttgaa gggccccaag ttcaagatgc ctgagatgca 3 600 
cttcaagacc cccaagaact ccatgcctga tgttgatttc aatttaaagg gacccaaaat 3660 
caaaggagat gttgatgttt ctgccccaaa gctggaggga gagttaaaag gtccagaatt 3720 
ggatgtcaaa ggtcccaaat tagatgctga catgccagaa gtagctgtgg aaggcccaaa 3780 
tggcaagtgg aaaactccta agttcaagat gccagatatg cactttaaag ctcccaaaat 3840 
ctctatgcca gacctcgatc tacacttgaa gagccccaag gcaaaaggag aggtggatgt 3 9 00 
agatgttccc aaattggaag gggaccttaa agggccacat gtggatgtca gtgggccaga 3 9 60 
cattgacatt gagggaccag agggcaaatt gaaaggccct aagttcaaga tgcctgatat 4020 
gcatttcaaa gcccccaata tttctatgcc tgatgttgat ctaaatctca aaggacccaa 4080 
aatcaagggg gatgtggatg tgtctgtgcc tgaggtagaa ggtaaacttg aagtaccaga 4140 
tatgaacatc aggggcccca aagttgatgt aaatgccccc gatgtccaag ctccagactg 4200 
gcacctgaag atgcccaaga tgaaaatgcc caagttcagc atgcctggct tcaaagcaga 4260 
ggaccctgaa gtagacgtca acttgcctaa ggctgacgtt gacatctcag gacccaaggt 4320 
ggacattgaa ggccctgatg ttaatattga aggaccagag ggaaagttga aagggcctaa 4380 
gttaaagatg ccagagatga acatcaaagc ccccaagatc tccatgcctg actttgattt 4440 
gcatctgaaa ggtcccaagg tgaagggcga tgtggatgtt tctctgccca aagtggaagg 4500 
tgacctcaag ggccccgaag ttgacatcaa ggggcccaaa gtggatatta atgccccaga 4560 
tgtgggtgtt caaggcccag actggcacct gaagatgccc aaggtgaaaa tgccaaagtt 4620 
cagtatgcct ggcttcaaag gagagggccc agatggggat gtgaagctgc ccaaggctga 4680 
cattgatgtc tcaggaccca aagtggacat tgaaggccct gatgttaaca ttgaaggacc 4740 
gtgg ~~ * 4744 

<210> 62 

<211> 7313 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 256009. 2. dec 

<400> 62 

aaagtggaag gcgacctcaa gggccccgaa gttgacatca gggaccccaa agtggacatt 60 

gatgtcccag atgtggacgt tcaaggccca gactggcacc taaaaatgcc caaagtgaaa 120 

atgcccaagt tcagcatgcc tggcttcaaa ggagagggcc cagatgtgga tgtgaacctg 180 

cccaaggctg acattgatgt ctcaggaccc aaagtggacg ttgatgttcc tgatgtggat 240 

atcgaaggtc cagatgcgaa actgaagggc cccaagttca agatgcctga gatgagcatc 3 00 

aaagccccca agate tccat gectgatatt gacttaaatc tgaaaggacc ctagggtgaa 3 60 

gggagatgtg gctgtttctc tgectaagat ggaaggtgat ctagaggccc ctgaagttga 420 

catcaagggc cccaaagtgg acattgatgc cccagatgtg gatgttcatg gcccagactg 480 

gcacctgaag atgeccaagg tgaaaatgcc caaattcagc atgecaggat ttaaaggaga 540 

gggcccagaa gtggatgtta atttgeccaa agctgacatt gatgtctcag gacccaaagt 600 

ggacattgac actcctgata ttgatattca tggtccagaa gggaaactga agggccccaa 660 

atttaaaatg cctgacctgc acctcaaggc accgaagatc tetatgectg aagttgacct 720 

gaatctgaaa ggtccaaaga tgaagggcga cgtggacgtt tctctgccca aagtggaagg 780 

cgacctcaag ggccctgaag ttgacatcaa gggccccaaa gtggacattg atgtcccaga 840 

tgtggacgtt caaggcccag actggcactt aaaaatgece aaagtgaaaa tgcccaagtt 900 

cagcatgcct ggcttcaaag gagagggccc agatgtggat gtgaacctgc ccaaggctga 960 

ccttgacgtc tcaggaccca aggtggacat tgatgttcct gatgtgaata tcgaaggtcc 1020 

agatgegaaa etaaagggee ctaaattcaa gatgectgag atgaacatca aagcccccaa 1080 

gatctccatg cctgactttg atttgeatet gaaaggtccc aaggtgaagg gtgatgtgga 1140 

tgtttccctt cctaaagtgg aaggtgacct caagggccca gaagttgaca tcaagggccc 1200 

caaagtggac atcgatgccc ctgatgtaga tgttcatggc ccagactggc acctgaagat 1260 

geccaaggtg aaaatgecca aattcagcat gecaggatte aaaggagagg geccagatgt 132 0 

ggatgttacc cttcctaagg ctgacattga gatttctggc cccaaagtgg acattgatgc 1380 

ccctgatgtc agtatcgaag gtccagatgc aaaactcaag ggtccaaagt teaagatgee 1440 

agagatgaac atcaaggccc ccaaaatctc catgcctgac attgacttta acttgaaggg 1500 

tcccaaagtg aaaggtgatg tggatgtctc tctgcccaaa gtggaaggtg atctcaaggg 1560 

ccctgaaatt gacataaaag gccccagttt ggacattgac acacctgatg tcaatattga 1620 

aggtceggaa ggaaaattga aggggeccaa atttaagatg cctgagatga acatcaaagc 1680 

tcccaaaatc tetatgectg actttgattt gcacctgaaa ggtcccaagg tgaagggtga 1740 

tgtggatgtt tcactaccta aggtggaaag tgatctgaaa gggecagagg tagacattga 1800 
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aggtcctgaa gggaagctca aaggtcccaa gtttaagatg cctgatgtac atttcaaaag 1860 
cccacaaatc tccatgagtg acattgattt gaatttgaaa ggacctaaga taaaaggaga 1920 
tatggacatt tccgttccta aactggaggg agatctgaaa ggtcccaaag tggatgtcaa 1980 
aggccctaaa gtgggcattg acactcctga tattgacatt catggtccag aagggaaact 2040 
gaagggcccc aaatttaaaa tgcctgactt acacctcaag gcaccgaaga tctctatgcc 2100 
tgaagttgac ctgaatctga aaggtccaaa ggtgaagggc gacatggaca tttctctgcc 2160 
caaagtggaa ggcgacctca agggccccga agttgacatc agggacccca. aagtggacat 2220 
tgatgtccca gatgtggacg ttcaaggccc agactggcac ctaaaaatgc ccaaagtgaa 2280 
aatgcccaag ttcagcatgc ctggcttcaa aggagagggc ccagatgtgg atgtgaacct 2340 
gcccaaggct gacattgatg tctcaggacc caaagtggac gttgatgttc ctgatgtgaa 2400 
tatcgaaggt ccagatgcga aactaaaggg ccccaagttc aagatgcctg agatgagcat 2460 
caaagccccc aagatctcca tgcctgatat tgacttaaac ctgaaaggac ccaaagtgaa 2 520 
gggcgatgtg gatgttaccc ttcctaaagt ggaaggtgac ctcaagggcc cagaagctga 2580 
catcaagggc ccaaaagtgg acatcaacac ccctgatgtg gatgttcatg gcccagactg 2640 
gcacctgaag atgcccaagg tgaaaatgcc caaattcagc atgcctggct tcaaaggaga 2700 
aggtccagat gtggatgtga acctgcccaa ggctgacatt gatgtctcag gacccaaagt 2760 
ggacgttgat gttcctgatg tgaatatcga aggtccagat gcgaaactaa agggccccaa 2820 
gttcaagatg cctgagatga gcatcaaagc ccccaagatc tccatgcctg atattgactt 2880 
aaacctgaaa ggacccaaag tgaagggcga tgtggatgtt acccttccta aagtggaagg 2940 
tgacctcaag ggcccagaag ctgacatcaa gggcccaaaa gtggacatca acacccctga 3 000 
tgtggatgtt catggcccag actggcacct gaagatgccc aaggtgaaaa tgcccaaatt 3 060 
cagcatgcct ggcttcaaag gagaaggtcc agatgtggat gtgagcctgc ccaaggccga 3120 
catcgatgtc tcgggaccca aggtggacgt tgatattcca gatgtgaata tcgaaggtcc 3180 
agacgcaaaa ctgaagggcc ccaagttcaa gatgcctgaa ataaatatca aagctcccaa 3240 
gatctccata cctgatgttg acctggattt gaaaggaccc aaagtaaaag gagattttga 3300 
tgtgtctgtc cctaaggttg aagggacttt gaaaggccca gaagtagatc ttaaaggtcc 33 60 
acgtctggat ttcgaaggcc ctgatgccaa actcagtggc ccatctttga agatgccatc 3420 
gctggagata tctgctccta aagtaactgc tcctgatgtt gatttgcatc tcaaggcacc 3480 
aaaaattgga ttttcaggtc cgaagttaga aggtggtgaa gtggacctca agggacccaa 3 540 
agttgaagct ccaagcttag atgtacacat ggacagccca gatattaaca tcgaagggcc 3 600 
agatgttaaa atccccaaat ttaagaaacc caagtttgga tttggggcaa aaagccccaa 3 660 
agctgacatc aagtcacctt cactggatgt cactgttcct gaggcagagc tgaaccttga 3720 
gactcctgaa attagtgttg gtggcaaggg caagaaaagt aagtttaaaa tgcctaaaat 3780 
tcatatgagt ggtcctaaga ttaaggccaa aaaaacaggg atttgacctg aatgttcctg 3840 
ggggtgaaat tgatgccagc ctcaaggctc cggatgtaga tgtcaacatc gcagggccgg 3 900 
atgctgcact caaagtcgac gtgaaatcgc ccaaaaccaa gaaaacgatg tttggaaaaa 3 9 60 
tgtacttccc agatgtagag tttgacatta aatcacctaa atttaaagct gaggcccctc 4020 
tccctagccc caaactggag ggtgaactcc aggcacctga tctggaactt tctttgccag 4080. 
cgattcacgt cgaaggtctt gacatcaagg cgaaggctcc caaggtcaag atgccagatg 4140 
tggacatctc agtgccaaaa atagagggtg acctgaaagg ccccaaagtg caggcaaact 4200 
tgggtgcacc tgacatcaac atcgaaggcc tagatgctaa agtcaaaaca ccgtccttcg 4260 
gcatttctgc ccctcaagtc tccatccctg atgtgaatgt aaacttgaaa ggaccaaaga 4320 
taaagggtga tgtccccagc gtgggactgg aaggaccaga tgtagatctg caaggtccag 4380 
aagcaaaaat taagttcccc aagttttcca tgcccaagat cggcatccca ggtgtgaaaa 4440 
tggagggtgg gggagccgag gtccatgccc agctaccctc tcttgaagga gacttgagag 4500 
gaccagatgt taagctcgaa gggcccgatg tttctctaaa ggggccagga gtagacttgc 4560 
cttcagtgaa cctctctatg ccaaaagtct ctgggcctga ccttgatctg aacttgaaag 4620 
gaccaagttt gaagggagac ctggatgcat ctgttcccag catgaaggtg catgctccag 4680 
ggctcaacct cagtggtgtc ggtggcaaaa tgcaggtggg aggagacggt gtgaaagtgc 4740 
cagggatcga tgccacaaca aagcttaacg ttggggcacc agatgtgaca ctgaggggac 4800 
caagcctgca gggagatctg gctgtctctg gtgacatcaa atgccctaaa gtatccgtag 4860 
gage tec tga tctaagcttg gaggcatccg aaggcagcat taaacttccc aaaatgaagc 4920 
tgccccaatt tggcatctct actceggggt ccgacttgca cgtcaatgcc aaggggecac 4980 
aggtttctgg cgaactgaag gggccaggtg tggatgtgaa cctgaaaggg ecteggattt 5040 
cagcaccgaa tgtggacttt aacttggaag gaccaaaagt gaaagggagc ettggggeca 5100 
ctggtgagat caaaggcccc actgteggag gaggtcttcc aggcattggt gttcaaggee 5160 
tagaaggaaa cctccagatg cctggaatta agtcctctgg atgtgatgtg aacctgccag 5220 
gcgtgaatgt gaaactccca actgggcaga tttctgggcc tgaaatcaaa ggtggtctga 5280 
aaggttcaga agtaggtttc catggggctg ctcctgatat cagtgtgaag gggcctgcct 5340 
ttaatatggc atctcctgag tcagattttg gcatcaactt gaagggecca aaaatcaaag 5400 
gaggtgcgga tgtttcaggg ggtgtcagtg ccccagacat cagccttggt gaagggcatt 5460 
tgagtgttaa aggttccggg ggtgagtgga agggacccca agtctcctct gctctcaact 5520 
tggacacatc taagtttggc tggggggect tcatttctca ggaccaaagg tggaaggagg 5580 
tgtgaaagga ggtcagattg gactccaggc tcctgggctg agtgtgtctg ggectcaagg 5640 
tcacttggaa agtggatctg gaaaagtaac attccctaaa atgaagatcc ccaaatttac 5700 
tttctctggc cgtgagctgg ttggcagaga aatgggggtg gatgttcact tccctaaagc 57 60 
agaggecage atccaagctg gtgctggaga cggcgagtgg gaagagtctg aagtcaaact 5820 
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gaaaaagtcc aagatcaaaa tgcccaagtt taatttttcc aaacctaaag ggaaaggtgg 5880 
tgtcactggc tcaccagaag catcaatttc tgggtccaaa ggtgacctga aaagttcaaa 5940 
ggccagcctg ggctctctgg aaggagaggc agaggccgaa gcctcttcac cgaaaggcaa 6000 
attctcctta tttaaaagta agaagccacg ggcaccgctc aaattcattc agtgatgaaa 6060 
gagagttctc tggaccttcc accccgacgg ggacgctgga gtttgaaggt ggggaagtgt 612 0 
ctctggaagg tgggaaagtt aaagggaaac acgggaagct gaaattcggt acctttggtg 6180 
gattggggtc aaagagcaaa ggtcattatg aggtgactgg gagcgatgat gagacaggca 6240 
agttacaggg gagtggggtg tccctggcct ctaagaagtc ccgactgtcc tcctcttcta 63 00 
gcaatgacag tgggaataag gttggcatcc agcttcccga ggtggagctg tcagtttcca 63 60 
caaagaaaga gtagcaggcc tttgtatgtg tgtacatata tatatatata acaaaacatc 6420 
agccttgggt ggtgtgttcc tatataaact ccaaagggaa acacaccgac tgcctcagca 6480 
atcatgcaaa gaccttgcct ggcccggtgg caagcgctga aaaaccgacc gcctgtaggc 6540 
tcctggaact atacagatag gtaaagagtt ccaagttcgt ccagcccatg tgcaaagtca 6600 
acagtatttg ccttaagatt tcatatatat atattttttt gcattgactg ctgagagctc 6660 
ctgtttacta agcaagcttt tgtgtttatt atcctcattt ttactgaaca ttgttagttt 6720 
tggggtaatg gaaacccact ttttcattgt aatgactttg ggggcttttg ttagtaaggg 6780 
tgggtggggt gatgggttgc agacggaggt caggtcttcc tctttcctga gactggatct 6840 
gttcaaacag caaacgccca cagatggccc agaggtggtg gtagtcaggg tgtgtgggtg 69 00 
tttttagggt tctttagtgt tgtttctttc acccaggggt ggtggtccca gccagtttgg 69 60 
tgctgacggt gagaggaaat tagaatctgt ttgcaaattg tccaacccac cccctcaaca 7020 
tgaggggctt ccattttctg tgttttgtaa gggaactgtt tccttcatgc cgccatgttc 7080 
ctgatattag ttctgatttc tttttaacaa atgttatcat gattaagaaa atttccagca 7140 
ctttaatggc caattaactg agaatgtaag aaaattgatg ctgtacaagg caaataaagc 7200 
tgtttattaa ccttgtacag catcaatttt cttccatttt ctgtgttatg taagggaact 7260 
atttccttca tgccgccatg ttcctgatat tagtactgat ttctttttaa caa 7313 

<210> 63 
<211> 1602 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 23 1892 . 12 . dec 
<220> 

<221> unsure 
<222> 118 

<223> a, t, c, g, or other 
<400> 63 

gccggccgag ggcagggctc 
cgcgagcggc gccagcttcg 
ggcgacggcg acgacgacga 
cttttcctcc ctccc ttacg 
gcttcctcca gacctctcgg 
ctgtaaccca aaggaggaaa 
tccttgaagc ttcaggcaag 
cgagtcttca ttggaaacct 
ttctctaagt atggccgtgt 
tactccaatg agcgccatgc 
gggcagaccc tggacatcaa 
aagagagcag catctgccat 
ccagtgccca gggcggtccc 
gtcaaaacta acgtacctgt 
gccaagatca agttaaagag 
aagtccaata tcgatgccct 
aatccagatg gcaagaagaa 
gcagcggtgg cggtggcagt 
cccaagagaa cacaacttct 
acggcgatga ggaagggctc 
cagacgcgga tgatggggcc 
gtgaagggca tcgctgcccc 
agcgggtacc agaggaaagc 
gccatgtcct ctgcaggtgg 
gcactgccca ggccagaggg 
gacactccca ggcttgggtt 



gcggttgcgg ggctcgcgcc 
gggcagcgga acccagagaa 
ctcccgcgcg tgtgcccagc 
tccccgagtg cggcagtacc 
cgcgggtgag ccctattccc 
cagctggcta agctcatcat 
caatgtaacc aacaagaatg 
caacacagct ctggtgaaga 
ggccggctgt tctgtgcaca 
ccgggcagct gtgctgggag 
catggctgga gagcctaagc 
atacaggctc ttcgactacc 
tgtgaagcga ccccgggtca 
caagctcttt gcccgctcca 
cagtgagctg caggccatca 
get gage cgc ttggagcaga 
gggtgatgga ggtggcgccg 
ggtggtggcg gtggcggtgg 
gaggcaggee tgccccaggg 
ctgacacaca gcgaggaaga 
ttgcagtaag cagcctgaca 
aggectcaag ccgggcaccc 
tggcagcagg cgcctcctcc 
agttactggc ctactccttc 
tagagcacag gggtttcccc 
ttttctatag gtttggcggg 



gctgtcagtg eggeggggeg 60 
gctgaggggg eggtagenge 120 
ctcttcccgc cgcagccgcc 180 
gcctccttcc cagccgcgcg 240 
agaggcaggt ggtgctgacc 3 00 
tgttactggt gggcaccatg 3 60 
accccaagtc catcaactct 42 0 
aatcagatgt ggagaccatc 480 
agggctatgc ctttgttcag 540 
agaatgggcg ggtgctggcc 600 
ctgacagacc caaggggcta 660 
ggggcegtet gtcgcccgtg 720 
cagtcccttt ggtccggcgt 780 
cagctgtcac caccagctca 840 
agaeggaget gacacagatc 900 
tegctgegga geaaaaggee 960 
geggggegge ggtggtggtg 1020 
cagcagccgg ccaccagccc 1080 
ggaagcaegg acccgagacg 1140 
gctggaacac agecaggaca 1200 
ggagcaatgg ccaccagcag 1260 
aaccctggat gccacccccc 1320 
cccaacgcat cccagccagt 13 80 
cccatgagcc ctccc tgtct 1440 
atactacctc ccctccccag 1500 
gggecacagg gaggggaccc 1560 
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tgacaataaa gagattggat cccaaaaaaa aaaaaaaagc gg 1602 

<210> 64 

<211> 2718 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 197445. l.oct 

<400> 64 

gcggccgccg ccgccgaggc ttacccggga atgtctgggc ccgcgcctcg cggcccccaa 60 
gctccacgct gcgcccgctg tcccggcctc taaaggccgc cacgtccctg cggcgcgcgc 120 
aggcagaaag cggcttcgtg ccggcggagg gggcccgggc gggccgggag gggctgcccc 180 
aggccctgcg cctaccccat caccgcggcc ggcgccgggc cgggaggatg cgcggtgtgg 240 
ggctctgaag catggagggg gtgttgtaca agtggaccaa ctatctcaca ggctggcagc 3 00 
ctcgttggtt tgttttagat aatggaatct tatcctacta tgattcacaa gatgatgttt 3 60 
gcaaagggag caaaggaagc ataaagatgg cagtttgtga aattaaagtt cattcagcag 420 
acaacacaag aatggaatta atcattcctg gagagcagca tttctacatg aaggcagtga 480 
atgcagctga aagacagagg tggctggtcg ctctggggag ctccaaagca tgtttgactg 540 
atacaaggac taaaaaagaa aaagaaataa gtgaaaccag tgaatcgctg aaaaccaaaa 600 
tgtctgaact tcgcctctac tgtgacctct taatgcagca agttcataca atacaggaat 660 
ttgttcacca tgatgagaat cattcatctc ctagtgcaga gaacatgaat gaagcctctt 720 
ctctgcttag tgccacgtgt aacacattca tcacaacgct tgaggaatgt gtgaagatag 780 
ccaatgccaa gtttaaacct gagatgtttc aactgcacca tccggatccc ttagtttctc 840 
ctgtgtcacc ttctcctgtt caaatgatga agcgttctgt cagccaccct ggttcttgca 900 
gttcagagag gagtagccac tctataaaag aaccagtatc tacacttcac cgactctccc 9 60 
agcgacgccg aagaacctac tcagatacag attcttgtag tgatattcct cttgaagacc 1020 
cagatagacc tgttcactgt tcaaaaaata cacttaatgg agatttggca tcagcaacca 1080 
ttcctgaaga aagcagactt atggccaaaa aacaatctga atcagaagat actcttccat 1140 
ccttctcttc ctgaagaaac tgaagtgtcc aacttcctct aagtattgct atgcaaaagc 1200 
tgctgtaatt aaactattgt tatagggagt agttttttcc cttaggactc tgcactttat 1260 
agaatgttgt aaaacagaca aacaagaaaa caaaccacat acttttgaag tgtattttat 1320 
ctttatatag tttgtttgca agagtatttt cctaataact tcacagtatg aatgtgcatc 1380 
tttttttttt gaacaaatga tggtgtaaca ttttgacatc cataaggaca aatgtagata 1440 
tttttcttaa aaactctgag gggactgaca gcatggtcag ggtgtattgt agcttataaa 1500 
catgaaatct tataaggttt cagtttgaca gaagtgtgat atatgtaact tgtgccatgg 1560; 
accaaatggt cactttacca cagctaaaaa tgagttacga tagcagcttg atggtgattg 1620 
tattgtattc ctttaatcaa aaaggaaaca caatattcta agtatcttta gcccaaatac 1680 
catgacatat tgagcatctt taaataacca gactgtattg tccttcatat gtgaagttga 1740 
cactactgat ttgtcaatac caaattttgg gttaaagtgt ttaattttta tgtatttatt 1800 
ttcttgttgc ctcaaaagat gattgcattc taacttttgt gacctaccaa atttaagatg 1860 
tgtatacgtt gttctttacg ttgttctaga aaagagattt taatgctgta gtgactttgc 1920 
tcacttacac tagagaaata aacaactttc aatggaagag aattttagtg cttttttttt 1980 
cctaaaatag atattaagct gctgttgtaa agtattgttt gcagctcttt ccaatatcta 2040 
gagacatttt tatttatgaa tatttataca aaaaggaatt ctgtcaagat gactgctcta 2100 
tatcacttga gaatggcatt atttaattaa agaacaaata gcattttttg gtagtgcctg 2160 
tccataccta ttgtcattgt ttgccttgta atctgttttt ttgaattcat tttgggctga 2220 
tagttttgtt taaggttttg gataaggagc actttaaaac aaactggtgt gttgttttta 2280 
agttaatcat atgtttaata aatgcgtggt ttttgcattc aaacacatca tataatacat 2340 
tgtatttttt acattcattg atattctgtc taatctttat taggcactaa tataattcta 2400 
atggatttga gtttgtttgt atatttaggc ctgctggtga agacacaaat gaagcataat 2460 
ttttgttgcc tgtgagctta gaatggtgtg acctatttca tttgatttca atttatcttg 2520 
taagtttatg tgggatattt taaaataatt aatatttggt atttgtttaa aaaactgtaa 2580 
ttataggcct tccatcttaa gtttaagatg gacataaaca gtctatacat taatatgtgt 2640 
ttgggtaaac tgtatgctac ttgaagtgtt agtaatttta aataacacac aggtcatcag 2700 
actactctat atacaatg 2718 

<210> 65 

<211> 1601 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 348775. l.oct 
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<220> 

<221> unsure 

<222> 631, 633, 635, 1339, 1343, 1353 
<223> a, t, c, g, or other 

<400> 65 

ttcaccctgt acttttatgt tgcagagatt gcttctttcc ttaaacctca tgaaccaacc 60 
tctaacagct tcaaattttt cttctgcagc ttctttacct ctcagtgttc acagaattga 120 
ggagagtcaa atccagaggt ctttctctgg attagacttt ggcttaagga aatattgtga 180 
ctggtttgat cttctatcca gaccactaaa atttttccat ataagcaata agactgtttt 240 
gctttcttac tcatgtgttc actgggagta gtacttctaa tttctttcaa gaacacttcg 3 00 
tttgcattca caacttggct aagtgtttgt tgcatgaggt ctagctactg gcctgtcttg 3 60 
cttacagcat gccttcctca ctaaggctta attatttctt ccttttggtt taaagtgaca 42 0 
gacatgcaac tcttctttca cttgaacata tagaggctat agtagggtta ttaattggcc 480 
acattttaat gttaataaaa ggaagcctga gaaaaagaga aagagaaatg gcccgttggt 540 
tgggcagtca gaacaaacgc atttgtcaat tgtttgctgt cttatcctgg tgtgatttgt 600 
ggttcccaaa acaatgacaa cagtagcatt nangntcact gattacagat caccacaaca 660 
gattcaataa taaaaatctt aaaatactgt gagaatgacc gaaatgtgac acagagacgt 72 0 
gaagtgagca cgtgctgtag gaacaatggg tgcccagtga gacctgctta ttgcagggtg 780 
gccacaaacc ttcaatacgt aaaacacatg gtcacaaaac acaataaagc aaagtgcagt 840 
gaaacaagat gtgtctgtct tttgatagac tctgacaatc tctacctttg aattggtaca 9 00 
ttcataccat taacattcaa agtgattatt gatatcattg gattaatatc tactatattt 960 
gttactgttt tctattcatt ctcctcagtc ttcattcttt tgtctaccac tctttttctg 1020 
ccttttgcag ttttcattga tgattttaga tgactccatt ttccctgtct ttcttagtac 1080 
atacttctct ttttaaaact tttttttaac tagttgccac agaatttgca atatacattt 1140 
acaaccaatt caagtccact ttcaaataac actatcccac tatcccacaa ataggactac 1200 
ctgcttaaca aacaaaacac ctaattcctc aatatacatt tacaaccaat tcaagtccac 1260 
tttcaaataa cactatccca ctatcccaca aataagacta cctgcttaac aaagaaaaca 1320 
cctgattcct ccctcccanc ctnccattcc atnccttgta ttattgttcc ttatttccct 13 80 
tgtgtataag catacataat ctatctgtgt gtatttattg ttatctacaa acttattggt 1440 
cagatcaatt atgaataaat acatgttttt attttaccac aattcctccc tcccatcctt 1500 
ccattccatt ccttgtatta gtgttactca tttcacttgt gtataagcat acataatcta 1560 
tctgtgtgta tttgttattg tctgtgaact tcttggtcag a 1601 

<210> 66 

<211> 2606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3 3 623 9. 5. dec 
<400> 66 

gctgcacttc ccaggcccca ccagccgcgg ctccggctcg tagcccacag cccactgccg 60 
gcggctgggc gctgccgagg ctcggggcgc gcgcagttgg cgtctgccag tgccaagact 12 0 
gtgccgcccc cacagccgag gcgcgaaagg gggacgcccg gcctctgggc cgctgccttc 180 
gctttctctt cgttgttgcg aacgccgtcc gctcaggagg cgccccgcga ccggcgcgat 240 
gagtgccaac gaggaccagg agatggaact agaagcatta cgctctattt atgaaggaga 3 00 
tgaaagtttc cgggaattaa gtccagtttc ttttcaatat aggataggtg aaaatggtga 3 60 
tcccaaagcc ttcttaatag agatttcctg gacagaaaca tatccccaaa cacctccaat 420 
tctatctatg aacgcttttt ttaacaacac catatcatca gctgtaaagc agagtatatt 480 
agccaagcta caggaagcag tagaagctaa tcttggaacc gctatgacct atacattgtt 540 
tgaatatgcc aaagacaata aagagcagtt catggagaat cacaatccca tcaattccgc 600 
aacatcgata agcaatatca tctcaattga aactcctaat acagccccat caagtaagaa 660 
aaaagacaaa aaagaacaac tttcaaaagc ccagaagcgt aagctggcag acaaaacaga 720 
tcacaaagga gaacttcctc gaggctggaa ctgggttgat gttgtgaagc atttaagcaa 780 
aactggctct aaggatgatg agtagcactt ggaatttgag acaaggaaag agcattcttt 840 
aaagagtaaa actgggttca aaatctttca ttactatttt ctggtattga ggcgactttt 900 
tataaaacac aattttttgt atgtttctta cattaaaaag gttgtaagtt gaaagttcat 9 60 
gaagagatct tgttgtatta aattattttc acaaacttgc cttaataaaa ggtgaaaatg 1020 
ttactgttta gtatacttta tgaagcccct tgagctttat aaatggacag gcatggggaa 1080 
taagaatcag tgttaattta aatgatctta tcctggtgga tgtgctattt tcttaaagga 1140 
gtatgaagcc cttttcaaac tatcatccca gtggagcgga gtactcagtg aacagttact 1200 
ccatagtgca atccatatta ataggcttct tctcttaagt cttcatctct tcttttgctt 1260 
aattactgaa ccgtaaatta cttcagagaa atttaaatgc tggtatttga actttataca 132 0 
tgatactttt tgtagtttct tttaattttt gaaagatgaa ctgcttcctt ttaataaatt 1380 
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aatatctatt tatacttttc tcttgatttg 
gtggtatgtg gaataggaga atataaaaac 
agtaagaaat gctggccctt tcttaaaaca 
aagatgcctt aatatttagt ttttgtattg 
ttatctgctg ttgtgtgctt ttagttgttg 
taaaaccaca tttcaagtct tgtttctttt 
ttctcaacct gagcatcttt ttaatcatat 
acacatgctt gatttaaaaa taataataat 
aaaagtttct gatgtttatg tgtgatgtac 
aacacataaa actattacgt ctggtaggaa 
agcttgggtc tgagttttga tagaacagta 
ctgtatccgc ggtttttact cgttaactga 
agtataatta cagcaaagga gttaatctca 
gaattaatgt agagcaaaac atgttaaaat 
ttcagggttg tgtgtgggta gtatttgtgg 
aacatctgct agtggaataa aatactttgt 
cttattatgg ctcatagatt acagagaatg 
actcttttta tatgaaatgt aaaaatttgt 
taccttatgt aaaacacttg aacagcctca 
cagtatattt tctcaatgtc tgtttactta 
aataaagtct gaaattatac ccagtg 

<210> 67 

<211> 1502 

<212> DNA 

<213> Homo sapiens 



ggtcaagatg tttgatcatg agtgctttga 1440 
aaatctgcca aatacactag aaagcatttt 1500 
tttctcttgc atataccagg atgggagtaa 1560 
ttggagacat tgattttaat aaaatcctat 1620 
gataactgag gtctcctaaa tggttcaaca 1680 
tggagtgtct tttcaagtat tcaaatgtat 1740 
acatgggagt cttttaaatg ctgaactgtt 1800 
agaggaaact attggtctag ttgtgccaag 1860 
agtgattttg tatatgcgcc cagctttaag 192 0 
gattgttagt gcctcaagtt acacctgtgc 1980 
aacatttaaa gaagttaaga gcagtttgag 2 040 
cttcagctaa atagtttgaa ttatagagta 2106 
ttttcaaagc tgtttctcat tttatttctt 2160 
tcaggaccac tggaatatgg caacttatgt 2220 
ttgtattggt ttgttttttg tttttggaga 2280 
tttgctctga agagactgaa attgttcagg 2340 
atgctagtta catgccaatg aactattttt 2400 
aggggttctg gtgatggtgg tacctcttat 2460 
tcaatattgc cgtcatctgt ttaacactcc 2520 
aaattttgtg gagtgacata attaataagc 2580 

2606 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 215660. 4. dec 



<400> 67 

cactcgacta ccaagatggc ggcccccggg 
gggtggtcgc gggcgatgcg gggctgccag 
gacctactaa gtgctcggct cttgtcccaa 
tttgagactg tgtcggaaga ggagaagggg 
gctaagaagt atgatgtgat gaatgatatg 
gatttgctgc tctggaagat gcacccgctt 
ggcacaggtg acattgcatt ccggttcctt 
cagaagaggc agttaagggc ccaacaaaat 
cagaatgaag aagattcctt gggcgggtct 
atgctaaagg ttggaaagca gaaagccttg 
gtattaggag atgctgaaga actgcccttt 
gcctttggga tccggaatgt cacacacatt 
ctgaaaccag gaggacggtt tctctgtctg 
tccaggcttt atgatctata tagcttccag 
ggagactgga agtcctatca gtaccttgta 
gagttcaagg acatgataga agatgcaggc 
tcaggcattg tggccattca ttctggcttc 
tgaaccagtc atatcctgtt gaaagcctgg 
cagcagagca tctcctctta aggatacgtg 
caaagtggaa gaacaaattc ttgtcacttt 
catctcccag aggcatttgg tctgtatctt 
ggtgtgtggt taaggtacaa ccacccctaa 
cttctctgct ggtgctgcct tctagaggga 
ttaaccagaa aatttaattg tacctgaatc 
caagtcagag tattaatgat catgagaatt 
tc 



agctgtgccc tatggagcta ttgcggccgt 60 
ctcctcgggc ttcgtagctc ttggcccggg 120 
gagaagcggg cagcggaaac gcactttggg 180 
ggcaaagtct atcaggtgtt tgaaagtgtg 240 
atgagtcttg gtatccatcg tgtttggaag 3 00 
cctgggaccc agctgcttga tgttgctgga 3 60 
aattatgttc agtcccagca tcagagaaaa 420 
ttatcctggg aagaaattgc caaagagtac 480 
cgtgtcgtgg tgtgtgacat caacaaggag 540 
gctcaaggat acagagctgg acttgcatgg 600 
gatgatgaca agtttgatat ttacaccatt 660 
gatcaggcac tccaggaagc tcatcgggtg 720 
gaatttagcc aagtgaacaa tcccctcata 780 
gtcatccctg tcctgggaga ggtcatcgct 840 
gagagtatcc gaaggtttcc gtctcaggaa 900 
tttcacaagg tgacttacga aagtctaaca 9 60 
aaactttaat tcctttccta tcatggagca 1020 
aactgaagga taatctggca aatgagacag 1080 
ccttggactc atgtttgaat cgaacagtct 1140 
tttacagctt tctttggagc tgcttcagtc 1200 
tgctcaactg ctaatttctc ttggctgtag 1260 
agctcagttt tgaagtgagt gtatttatag 1320 
tgatagatca tttgaaccca gtgacaattt 1380 
aacctttcag cctaggacga agtctaggcc 1440 
gtgtgctgaa ccagtaaacg agtttacctt 1500 

1502 



<210> 68 

<211> 3349 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> incyte ID No: 3 9194 0 . 2 -dec 
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<220> 

<221> unsure 
<222> 2986 

<223> a, t, c, g, or other 
<400> 68 

atttatcaag actttttatg agaataggta aaccaagcaa taacttccta ggactgaatc 60 
accaccccag aagagcgaga ggctcctttc atatgcctga ggccacaccc cttaacctgt 120 
tctgacaaaa tagtggctgg cccatgtacc agctccattc agaaattcag gaagagaaaa 180 
gacagccctg ttgtcacaca aaccggtgtg ggggagggtg gagcctggtc tgcacggcag 240 
tcctggtggc ccctgtggag gacaggcagg gctggcagca tagcctttgt tgccacacaa 3 00 
• ccggaatttg ctcccccagg actgtgggag ccagtgtccc agctgaaatc tttttagtgt 3 60 
gtggctctga atggcactca cattccattt tggctcacat gaaactaact gaagcccttt 420 
gttcaagctt caggctctta ggcatggaaa tgagaatgtg actgtggctg tcttacagga 480 
aaattcttgt ttgtccctga atgagagcac agaggcattg aattcacaga gctgcaaact 540 
tgcctgataa atgagggagt ggcagtttat agataggtca tcttttttcc ttcctccagg 600 
tgtccttgcc tttcttccca aagtcattca tttctgatga gtatatgaat ccccctcttg 660 
ctagtaaggt tctatttggg ctaaaacaag gctgaatttt taaagagtat ttgaatatat 720 
tttagaatca aattgaggct ataaattgca tcaatctgga caattccatt gcaggaataa 780 
tatgttaaaa accaatgggg agaagcaccc acatctctcc tgtagcactc cgtgtctcat 840 
aagcaatttg aagacactta caagtaactg attccagtca aattaggatt aactgactca 9 00 
aaaaatggtg tcaagtttct ttaatgtttt tatgttagaa gtgagtttaa cagacttgaa 9 60 
gaaaactgtt atcttttcct gctgtgagtt tacacaaatg attccagagc agaatgaaag 1020 
cagaaagctg ttggttacaa tattctttta acctctctgc agcattttac acttactggg 1080 
aaccttatga ttcaccgtaa gagtggaaat atacctgagt tcgtgtccta atggtctcta 1140 
attcacattg gatcgtgggc aaatcacctc acctctctga gcctgttccc tcctcttaga 1200 
ccatctctaa gaccacttca tctatttaca catcatttgc ttgaacattg ctgaacatct 1260 
gcgtgaactt ggcctctcca gcccttgcag gtggaaacag ctgtgtcaag gctcaaggct 1320 
cacgctgagg ggacttggag ggagggggct tctgcattaa gctttcctgg tgaagaccct 1380 
tgatcttgtc caaagccctg tgtctttgac tggcttctct tcagagtccc cgttgtcatc 1440 
gtaagaccct tgctgtttgg agggtggtct tgtgactgtg gcagctgctg gccgctggaa 1500 
tgaggagcct atctccatcc tccagtgtga ctcaggcaga gcattgagaa ttcccagggg 1560 
cagaaatcct tcctgctcag gctttcattc taaaactaca gtcttcatta aagctgaact 162 0. 
ttctgggtag ctgagcttat atgcccggca tctgaatgag agctctcttt gtaactgtgt 1680 
gacttgagat ctagtttgcc agctcctggg aaacaataca tgtgttcttg tttgtgtttg 1740 
ctcagcaagc agatgtctga gatgtaagaa gcttttcttt tcctgtggca ttgattctga 1800 
cttagagctg aagtaaagat cactgaaaca tcacgtcaag ttgaagtcac tcataggtct 1860 
ttgtccttta ggcaggacag gagagtcatt aagaagcatt tcactgtagc attctatcac 192 0. 
aatatcatct ggaattgttt tctttgccca gaaagcctta acttgcctct agagaatccc 198 0 
tggtattaca acgatattgc ggcattagaa ttccaactct tctgctgtgg aagtttgaag 2040 
cgaagctgca gcaaaaccag agaatttcct caagtggcct gtaggctcct tgttatctta 2100 
tgcccccacc cctccctcaa caatatgagt gatccagaac tggcccaaac acctcagctc 2160 
tggtcccttt ttgcccttct tggccttact ctgttgttca aagccacttt ggattgcttg 2220 
gatgcttcga acagccatga aaagtagcct gcctgtggca tttagaggcc aagcaattga 2280 
cagaaagggt ttcttctacc tctgttatct aagcagaggg aagtaaactt ctcaccgccc 2340 
cccacccctc actgcccccg attacactag aattgctttc gccaaattgt agttgaagct 2400 
aaggaagggg aatctggccc ctgctgggag agggaactgg aatgccacac aaggcaaggc 2460 
ctgcttcctt ccttcccctc tgctgctgct gcctcggaac gctgcagccc aggcttcctc 2520 
ccacagtggc ccttggaagc aggccgcaga gtagacagct gctccttttg gaagagtcag 2580 
tcccctgtgt tttctgaact gtttttccta gcatgtatgt gggtagagct ttcatgcatc 2640 
tctagtaata ataagctgaa attagttttt tttttaattc tccaatttaa aacttttaat 2700 
taaaaagtaa attttaatgt cgaaaatgca aacttgggga gggcagaaag atcacacaca 2760 
aggctgtcac ttcacacttg gaggattgca cagcagccgg gcaaaggctc tcctcacttc 2820 
ccagatgggg gcgggcgggg cagcagagac gcacctcact ccttagacag tgcggcagcc 2880 
aggcagaggg ggctcctcat atcccagacg atgggcggcc caggcagaga cgctcctcac 2940 
ttccccagat ggggcggctc ccgggaagcg gggctccctc acttcnccag acagggtggc 3 000 
caggcagagg tgctcctcac ttcccagaac aattctttat gaatttgata aaggactgaa 3 060 
gtgcaactga aagctgctag tgatgatctg gtaatataca atttgtccag tagccagttt 3120 
gtttttattg tgttttctaa ccataagaga tcattaaagg caaagcctgt atgacgctgt 3180 
acacacacaa aaaaatggtc accgcaggcc atactaccaa tgaaatggta ggtaaacaaa 3240 
tcttctggtc aagagaaaaa aaaaagaaat agcactctgc atgctttgct ctacaagatg 3300 
aatttcccta gaaagaatcc aatgaaggcc gggcatagtg gctcactcc 3349 

<210> 69 

<211> 2599 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 978302. 3. dec 

<400> 69 

gggaaccctg ggagcttccg cgcctgccca gttttgctcc gaaagactta ccgaggaggg 60 

agcttgcggt gcgttctggg aaagttgctg gggccagctc ctttgtttcc agtctgagcg 120 

ttgcgttcgg tttcccgagg gtcttctgag gcaccgcggc tgcgggcttc tgagttcccg 180 

gctctccgca gggaagcctc ctcttcgtac ctcgtttttt ggctcgtggg gggtcctccc 240 

accgctggcc gacgcagcca gcatgtccgg ggtgcgcgca gtgcggatca gcatcgaatc 3 00 

ggcctgcgag aagcaggtcc atgaggtggg cctggatggc accgagacgt acctgccccc 3 60 

gctgtccatg tcgcagaatc tggcgcgtct ggcccagcgg atagacttca gccagggttc 420 

gggctccgaa gaagaggagg cggcggggac cgagggcgac gcgcaggagt ggccgggcgc 480 

cgggtccagc gcagaccagg acgacgagga aggagtggta aaatttcagc cttccctttg 540 

gccttgggac tcagtgagga acaatttgag aagtgccctg acagagatgt gtgttctcta 600 

tgatgttctc agtattgtta gggataaaaa atttatgact cttgatcctg tctctcagga 660 

tgcacttcct ccaaaacaga atcctcagac gttgcaattg atatctaaaa agaagtcact 720 

tgctggagca gcacaaatct tattgaaggg ggcagaaaga ctgactaaat cagttaccga 780 

aaaccaagaa aacaagctac aaagagactt caattctgag cttttgcgat tacggcaaca 840 

ctggaaactt cgaaaagttg gagataaaat tctcggagat ctgagctaca gaagtgcagg 9 00 

atctctcttt cctcatcatg gtacatttga agtaataaag aatacagatc tcgatctgga 960 

taaaaagata cctgaagatt actgtcctct tgatgtccaa attcctagtg atttagaggg 1020 

gtctgcatat atcaaggttt caatacaaaa acaggctcca gatataggtg acctcggcac 1080 

agttaacctc ttcaaacgac ctttgcccaa atccaaacca ggttccccac attggcagac 1140 

aaaattagaa gcggcacaga atgttctctt atgtaaagaa atttttgcac agctctctcg 1200 

ggaagctgtt caaattaaat cacaagtccc tcacattgtg gtgaaaaacc agattatctc 1260 

tcagcccttt ccgagcttgc agttatctat ttctttgtgc cattcctcaa atgataagaa 1320 

atcccaaaaa tttgctactg agaagcaatg tccggaggac cacctttatg tcctagagca 13 80 

taatttgcat ctactgatta gagagtttca taaacagacc ttgagttcca tcatgatgcc 1440 

tcatccagca agtgcacctt ttggccacaa gagaatgaga ctttcgggtc ctcaagcttt 1500 

tgataaaaat gaaattaatt cattacagtc cagtgaaggg cttctggaaa aaataattaa 1560 

acaagcaaag catatttttc taaggagtag agctgctgca accattgaca gcttagcaag 1620 

ccgaattgag gatcctcaga tacaggctca ttggtcaaat atcaatgatg tttatgaatc 1680 

tagtgtgaaa gttttaatca catcacaagg ctatgaacaa atatgcaagt ccattcaact 1740' 

gcaattgaat attggagttg agcagattcg agttgtacat agagatggaa gagtaattac 1800 

actgtcttat caggagcagg agctacagga ttttcttctg tctcagatgt cacagcacca 1860 

ggtacatgca gttcagcaac tcgccaaggt tatgggctgg caagtactga gcttcagtaa 1920 

tcatgtggga cttggaccta tagagagcat tggtaatgca tctgccatca cggtggcctc 1980. 

cccaagtggt gactatgcta tttcagttcg taatggacct gaaagtggca gcaagattat 2040 ; 

ggttcagttt cctcgtaacc aatgtaaaga ccttccaaaa agtgatgttt tacaagataa 2100 

caaatggagt catcttcgtg ggccattcaa agaagttcag tggaataaaa tggaaggtcg 2160 

aaattttgtt tataaaatgg agctgcttat gtctgcactt agcccttgtc tactatgatt 2220 

ttttccagat gtttcctaaa gaagtttcca gaaactttga cttgaaatgt ttgcagatca 2280 

actataagca caaagaagag ataacttcca aaagagtgct gtttttaaaa ataataatta 2340 

ggaaatgttt atttagcact ttcaaacttt tcactttata aatgacaagt gctttgaaat 2400 

gcagaagttt atgtacagtt gtatatacag tatgacaaga tgtaaaataa tatgtttttc 2460 

atgcagttta aaatattact aacttaaggg tttctatgtg ctttttaaaa tattccttct 2520 

ttgatgttga catcaaataa agtatgtggt ttaaaaaaat ctccaaatac ctttttttcc 2580 

ccccaaatac tttctaaac 2599 

<210> 70 

<211> 2085 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 228629 . 11 . dec 
<400> 70 

gggcggggtg cttagggtgc aggaggcgcg cgcctagcgg cggagtgtgg cgtgaggccg 60 
ggcccgcgcc gccatgaacc tagagcggct gcggaagcgc gtccggcagt acctcgacca 120 
gcaacagtat caaagtgctc tattttgggc agataaagta gcttcactct ctcgtgaaga 180 
accccaggac atctattggt tggctcaatg tctttacctg acagcacaat atcacagagc 240 
cgcccatgca cttcggtcac gaaaactgga caaattgtat gaagcatgtc gttaccttgc 3 00 
agctaggtgc cattatgctg caaaagagca ccagcaggcc cttgatgttc ttgacatgga 3 60 
agagcccatc aataaaagat tatttgaaaa atacttgaag gatgaaagtg gcttcaaaga 420 



48/50 



WO 01/21836 



PCT/US00/25643 



tccctccagc gactgggaaa tgtcacagtc ttcaataaag agttctatct gtcttctacg 480 
cgggaaaatc tatgatgctc tagataaccg aaccctgcta cctacagcta caaagaagct 540 
ttgaagcttg atgtctactg ttttgaagcg ttcgatcttt taacatcaca tcacatgctg 600 
acagcacaag aagaaaaaga acttcttgaa tcactacccc ttagcaagct gtgtaatgaa 660 
gaacaggaat tgctgcgttt tctatttgag aacaaattga aaaaatataa taagcctagt 72 0 
gaaacggtca tccctgaatc tgtagatggc ttgcaagaga atctggatgt ggtagtgtct 780 
ttagctgaga gacattatta taactgtgat tttaaaatgt gctacaagct tacttctgta 840 
gtaatggaga aagatccttt ccatgcaagt tgtttacctg tacatatagg gacgcttgta 900 
gagctgaata aagccaatga acttttctat ctttctcata aactggtgga tttatatcct 960 
agtaatcctg tgtcttggtt tgcagtggga tgttactatc tcatggtcgg tcataaaaat 1020 
gaacatgcca gaagatatct cagcaaagcc acaacacttg agaaaaccta tggacctgca 1080 
tggatagcct atggacattc atttgcggtg gagagtgagc acgaccaagc gatggctgct 1140 
tacttcacag cagcacagct gatgaaaggg tgtcatttgc ctatgctgta tattggatta 1200 
gaatatggtt tgaccaataa ctcaaaacta gctgaaaggt tcttcagcca agctctgagc 1260 
attgcaccgg aagacccttt tgttatgcat gaggtcggcg tggttgcatt tcagaatgga 1320 
gaatggaaaa cagccgaaaa atggtttctt gatgctttgg aaaaaattaa agcaattggg 1380 
aacgaggtaa cagttgacaa atgggaacct ttgttgaaca acttggggca tgtctgcaga 1440 
aaacttaaaa agtatgctga ggccttggat taccaccgtc aggcactggt gttgattcct 1500 
cagaacgcat ccacctactc tgctattgga tatatccaca gtctgatggg caactttgaa 1560 
aatgctgtgg actacttcca cacagccctt ggtcttaggc gagatgatac attttctgtt 1620 
acaatgcttg gtcattgcat cgaaatgtac attggtgatt ctgaagctta tattggagca 1680 
gacattaaag acaaattaaa atgttatgac tttgatgtgc atacaatgaa gacactaaaa 1740 
aacattattt cacctccgtg ggatttcagg gaatttgaag tagaaaaaca gactgcagaa 1800 
gaaacggggc ttacgccatt ggaaacctca aggaaaactc cagattccag accttccttg 1860 
gaagaaacct ttgaaattga aatgaatgaa agtgacatga tgttagagac atctatgtca 1920 
gaccacagca cgtgactcca gtcagtggtc ctggtcccac tgtcccagtg taggaacaga 1980 
gacccgcctt aagagactgg atcgcacacc tttgcaacag atgtgttctg attctctgaa 2040 
cctacaaaat agttatacat agtggaataa agaaggtaaa ccatc 2085 

<210> 71 

<211> 1673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 011211. 5. dec 

<220> 

<221> unsure 
<222> 188 

<223> a, t, c, g, or other 
<400> 71 

gcttccgagc cggacccaag ggccggggcg tggaggagta gaggggcgag cgcatgcgca 60 
caggactaca cgtcccgaca ggcgtcggga gcggcggccc agttccttgt gggagctgta 120 
gttctgcagg cgcggaagcc gtggtgctcg gccggcagag cactcggttt cccagagggc 180 
tgagcgcncc gcacggaggt gcggcgccgg accaagatgg agactgccga gcagccttga 24 0 
gccgttgagc agctgaacag aggccatgcc gggggcactc cgaggcctga gacgaccacg 300 
cctgtgccgc tgaggacctt catcagggct ccgtccactt ggcccgcttg gctgtccaat 3 60 
cacactccag tgtcaaccac tggcacccag cagccaagag aggtgtggcg tggccctggg 420 
gacgcatggc tgaggcagga acaggtgagc cgtcccccag cgtggagggc gaacacggga 480 
cggagtatga cacgctgcct tccgacacag tctccctcag tgactcggac tctgacctca 540 
gcttgcccgg tggtgctgaa gtggaagcac tgtccccgat ggggctgcct ggggaggagg 600 
attcaggtcc tgatgagccg ccctcacccc cgtcaggcct cctcccagcc acggtgcagc 660 
cattccatct gagaggcatg agctccacct tctcccagcg cagccgtgac atctttgact 720 
gcctggaggg ggcggccaga cgggctccat cc tctgtggc ccacaccagc atgagtgaca 780 
acggaggctt caagcggccc ctagcgccct caggccggtc tccagtggaa ggcctgggca 840 
gggcccatcg gagccctgcc tcaccaaggg tgcctccggt ccccgactac gtggcacacc 900 
ccgagcgctg gaccaagtac agcctggaag atgtgaccga ggtcagcgag cagagcaatc 960 
aggccaccgc cctgggcctt cctgggctcc cagagcctgg gctgccccca ctgactgcgt 102 0 
gtcctccttc aaccaggatc cctccagctg tggggagggg agggtcatct tcaccaaacc 1080 
agtccgaggg gtcgaagcca gacacgagag gaagagggtc ctggggaagg tgggagagcc 1140 
aggcaggggc ggccttggga atcctgccac agacaggggc gagggccctg tggagctggc 1200 
ccatctggcc gggcccggga gcccagaggc tgaggagtgg ggcagccccc atggaggcct 12 60 
gcaggaggtg gaggcactgt cagggtctgt ccacagtggg tctgtgccag gtctcccgcc 1320 
ggtggaaact gttggcttcc atggcagcag gaagcggagt cgagaccact tccggaacaa 1380 
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gagcagcagc cccgaggacc caggtgctga ggtctgagag ggagatggcc cagcctgacc 1440 
ccactggcca ctgccatcct gctgccttcc cagtggggct ggtcaggggg cagcctggcc 1500 
actgcctagc tggaatggga ggaagcctgc aggtggcacc ggtggccctg gctgcagttc 1560 
tgggcagcat cctcccaagc agagaccttg ctgaagctcc tggggtgtgg ggtgtgggct 1620 
ggaagcactg gctccctggt agggacaata aaggttttgg gtctttctga gaa 1673 
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